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PREFACE 


In 1975 I gave a course in partial differential equations (PDE) at the Uni- 
versity of Washington to an audience consisting of graduate students who 
had taken the standard first-year analysis courses but who had little back- 
ground in PDE. Accordingly, it focused on basic classical results in PDE 
but aimed in the direction of the recent developments and made fairly free 
use of the techniques of real and complex analysis. The roughly polished 
notes for that course constituted the first edition of this book, which has 
enjoyed some success for the past two decades as a “modern” introduction 
to PDE. From time to time, however, my conscience has nagged me to 
make some revisions — to clean some things up, add more exercises, and 
include some material on pseudodifferential operators. 

Meanwhile, in 1981 I gave another course in Fourier methods in PDE 
for the Programme in Applications of Mathematics at the Tata Institute 
for Fundamental Research in Bangalore, the notes for which were published 
in the Tata Lectures series under the title Lectures on Partial Differential 
Equations. They included applications of Fourier analysis to the study of 
constant coefficient equations (especially the Laplace, heat, and wave equa- 
tions) and an introduction to pseudodifferential operators and Calderdn- 
Zygmund singular integral operators. These notes were found useful by a 
number of people, but they went out of print after a few years. 

Out of all this has emerged the present book. Its intended audience is 
the same as that of the first edition: students who are conversant with real 
analysis (the Lebesgue integral, L? spaces, rudiments of Banach and Hilbert 
space theory), basic complex analysis (power series and contour integrals), 
and the big theorems of advanced calculus (the divergence theorem, the 
implicit function theorem, etc.). Its aim is also the same as that of the first 
edition: to present some basic classical results in a modern setting and 
to develop some aspects of the newer theory to a point where the student 
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will be equipped to read more advanced treatises. It consists essentially of 
the union of the first edition and the Tata notes, with the omission of the 
LP theory of singular integrals (for which the reader is referred to Stein’s 
classic book [45]) and the addition of quite a few exercises. 

Apart from the exercises, the main substantive changes from the first 
edition to this one are as follows. 


¢ §1F has been expanded to include the full Malgrange-Fhrenpreis theo- 
rem and the relation between smoothness of fundamental solutions and 
hypoellipticity, which simplifies the discussion at a few later points. 

¢ Chapter 2 now begins with a brief new section on symmetry properties 
of the Laplacian. 

« The discussion of the equation Au = f in §2C (formerly §2B) has been 
expanded to include the full Hdélder regularity theorem (and, as a by- 
product, the continuity of singular integrals on Hélder spaces). 

¢ The solution of the Dirichlet problem in a half-space (§2G) is now done 
in a way more closely related to the preceding sections, and the Fourier- 
analytic derivation has been moved to §4B. 

¢ I have corrected a serious error in the treatment of the two-dimensional 
case in §3E. I am indebted to Leon Greenberg for sending me an analysis 
of the error and suggesting Proposition (3.36b) as a way to fix it. 

¢ The discussion of functions of the Laplacian in the old §4A has been 
expanded and given its own section, §4B. 

e Chapter 5 contains a new section (§5D) on the Fourier analysis of the 
wave equation. 

« The first section of Chapter 6 has been split in two and expanded to 
include the interpolation theorem for operators on Sobolev spaces and 
the local coordinate invariance of Sobolev spaces. 

e A new section (§6D) has been added to present I]érmander’s charac- 
terization of hypoelliptic operators with constant coefficients. 

¢ Chapter 8, on pseudodifferential operators, is entirely new. 


In addition to these items, I have done a fair amount of rewriting in 
order to improve the exposition. I have also made a few changes in notation 
~—- most notably, the substitution of (f | g) for (f, g) to denote the Hermitian 
inner product f fg, as distinguished from the bilinear pairing (f,g) = f fg. 
(I have sworn off using parentheses, perhaps the most overworked symbols 
in mathematics, to denote inner products.) I call the reader’s attention to 
the existence of an index of symbols as well as a regular index at the back 
of the book. 


Preface xi 


The bias toward elliptic equations in the first edition is equally evident 
here. I feel a little guilty about not including more on hyperbolic equations, 
but that is a subject for another book by another author. 

The discussions of elliptic regularity in §6C and §7F and of Garding’s 
inequality in §7D may look a little old-fashioned now, as the machinery of 
pseudodifferential operators has come to be accepted as the “right” way 
to obtain these results. Indeed, I rederive (and generalize) Garding’s in- 
equality and the local regularity theorem by this method in §8F. However, 
I think the “low-tech” arguments in the earlier sections are also worth re- 
taining. They provide the quickest proofs when one starts from scratch, 
and they show that the results are really of a fairly elementary nature. 

I have revised and updated the bibliography, but it remains rather 
short and quite unscholarly. Wherever possible, ] have preferred to give 
references to expository books and articles rather than to research papers, 
of which only a few are cited. 

In the preface to the first edition I expressed my gratitude to my teach- 
ers J. J. Kohn and E. M. Stein, who influenced my point of view on much 
of the material contained therein. The same sentiment applies equally to 
the present work. 


Gerald B. Folland 
Seattle, March 1995 


Chapter 0 
PRELIMINARIES 


The purpose of this chapter is to fix some terminology that will be used 
throughout the book, and to present a few analytical tools which are not 
included in the prerequisites. It is intended mainly as a reference rather 
than as a systematic text. 


A. Notations and Definitions 


Points and sets in Euclidean space 


IR will denote the real numbers, C the complex numbers. We will be 
working in R®, and n will always denote the dimension. Points in R” will 
generally be denoted by z,y,&,n; the coordinates of x are (21,...,Zn). 
Occasionally 21, 22,... will denote a sequence of points in R" rather than 
coordinates, but this will always be clear from the context. Once in a while 
there will be some confusion as to whether (21,...,2n) denotes a point in 
R” or the n-tuple of coordinate functions on R". However, it would be 
too troublesome to adopt systematically a more precise notation; readers 
should consider themselves warned that this ambiguity will arise when we 
consider coordinate systems other than the standard one. 

If U is a subset of R", U will denote its closure and @U its boundary. 
The word domain will be used to mean an open set 2 C R", not necessarily 
connected, such that €Q = 4(R"\ 9). (That is, all the boundary points of 
Q are “accessible from the outside.”) 

If z and y are points of R” or (”, we set 


n 
z-y= > ay, 
1 
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so the Euclidean norm of z is given by 
jz) = (2-2)? (= (z-2)/? if x is real.) 


We use the following notation for spheres and (open) balls: if z € R" and 
r>0, 

S-(z) = {y ER": |z-yl=r}, 

B,(z) = {y ER": |zx—y| <r}. 


Measures and integrals 


The integral of a function f over a subset 2 of R” with respect to 
Lebesgue measure will be denoted by [., f(z) dz or simply by f, f. If no 
subscript occurs on the integral sign, the region of integration is understood 
to be R". If S is a smooth hypersurface (see the next section), the natural 
Euclidean surface measure on S will be denoted by do; thus the integral of 
f over Sis f, f(x) do(r), or f, f do, or just f, f. The meaning of do thus 
depends on S, but this will cause no confusion. 

If f and g are functions whose product is integrable on IR", we shall 
sometimes write 


na= fsa (fla = ff, 


where § is the complex conjugate of g. The Hermitian pairing (f{g) will 
be used only when we are working with the Hilbert space L? or a variant 
of it, whereas the bilinear pairing (f,g) will be used more generally. 


Multi-indices and derivatives 


An n-tuple a = (aj,...@,) of nonnegative integers will be called a 
multi-index. We define 


n 
Jo| = y a;, a! = ay!a2!---an!, 
r 


and for z € R", 


1 a2 


e% = zy 2p a 


‘@,". 


We will generally use the shorthand 


a] 
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for derivatives on IR". Higher-order derivatives are then conveniently ex- 
pressed by multi-indices: 


n az fox} 
=e) =a ae 
‘1 x5 zy t+: Oza 
Note in particular that if a = 0, 8% is the identity operator. With this 
notation, it would be natural to denote by Ou the n-tuple of functions 


(A,u,...,d,u) when u is a differentiable function; however, we shall use 
instead the more common notation 


Vu = (Q\u,..., Ont). 


For our purposes, a vector field on a set Q € R" is simply an R"- 
valued function on Q. If F is a vector field on an open set 0, we define the 
directional derivative Or by 


Or =F-YV, 


that is, if u is a differentiable function on Q, 
n 

Opu(z) = F(z) - Vu(x) = x F;(z)6; u(z). 
1 


Function spaces 


If Q is a subset of R”, C(Q) will dente the space of continuous complex- 
valued functions on Q (with respect to the relative topology on 2). If Q is 
open and k is a positive integer, C*(Q) will denote the space of functions 
possessing continuous derivatives up to order k on Q, and C*({) will denote 
the space of all u € C*() such that 0%u extends continuously to the 
closure 2 for 0 < |a| < k. Also, we set C°(Q) = (Pp? C#(Q) and C°(Q) = 
ne chm). 

We next define the Holder or Lipschitz spaces C*(Q), where 2 is an 
open set and 0 < a < 1. (Here a is a real number, not a multi-index; the 
use of the letter “a” in both these contexts is standard.) C%() is the space 
of continuous functions on 2 that satisfy a locally uniform Holder condition 
with exponent a. That is, u € C%(Q) if and only if for any compact V C 2 
there is a constant c > 0 such that for all y € IR” sufficiently close to 0, 


sup |u(z + y) — u(z)| < elyl*. 
xeV 
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(Note that C1(Q) c C%(Q) for all a < 1, by the mean value theorem.) If 
k is a positive integer, C*+%(Q) will denote the set of all u € C*() such 
that 0°u € C%(Q) for all muli-indices § with |G] = k (or equivalently, with 
\8| < &; the lower-order derivatives are automatically in C1(Q) C C%(Q).). 

The support of a function u, denoted by supp u, is the complement of 
the largest open set on which u= 0. If 2 C IR", we denote by C'O°(Q) the 
space of al] C™ functions on R" whose support is compact and contained 
in Q. (In particular, if 2 is open such functions vanish near 02.) 

The space C*(IR") will be denoted simply by C*. Likewise for C™, 
Ct and C2. 

If OQ C R" is open, a function u € C™(Q) is said to be analytic in Q 
if it can be expanded in a power series about every point of 2. That is, 
u is analytic on Q if for each z € Q there exists r > 0 such that for all 


¥ € B-(z), ga 
uy) = yy TH y— 2), 
lejz0 


the series being absolutely and uniformly convergent on B,(r). When re- 
ferring to complex-analytic functions, we shall always use the word holo- 
morphic. 

The Schwartz class § = 8(IR”) is the space of all C® functions on IR" 
which, together with all their derivatives, die out faster than any power of 
x at infinity. That is, u € S if and only if u € C™ and for all multi-indices 
a and f, 

sup |z%d? u(x)| < OO. 
rer 


Big O and little o 


We occasionally employ the big and little o notation for orders of mag- 
nitude. Namely, when we are considering the behavior of functions in a 
neighborhood of a point a (which may be oo), O(f(z)) denotes any func- 
tion g(x) such that |g(x)| < C|f()| for x near a, and o( f(z)) denotes any 
function A(z) such that A(z)/ f(z) ~ 0 ast — a. 


B. Results from Advanced Calculus 


A subset 5 of IR" is called a hypersurface of class C* (1 < k < 00) if for 
every Zo € S there is an open set V C R" containing zo and a real-valued 
function ¢ € C*(V) such that V¢ is nonvanishing on SNV and 


SAV ={zrEV:¢(z) =O}. 
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In this case, by the implicit function theorem we can solve the equation 
¢(t) = 0 near zo for some coordinate 2; — for convenience, say i = n — 
to obtain 

fp = W(r1,...,2n-1) 


for some C* function w. A neighborhood of zo in S can then be mapped 
to a piece of the hyperplane zy, = 0 by the C* transformation 


z—+(2’, tn — ¥(2')) (2! = (x1,...,2n-1)). 


This same neighborhood can also be represented in parametric form as 
the image of an open set in R°~! (with coordinate 2’) under the map 


x’ — (2’, Y(2’)). 


x’ may be thought of as giving local coordinates on S near zo. 

Similar considerations apply if “C*” is replaced by “analytic.” 

With S, V, ¢ as above, the vector Vd(x) is perpendicular to S$ at x 
for every z € SCV. We shall always suppose that S is oriented, that 
is, that we have made a choice of unit vector v(x) for each z € S, varying 
continuously with x, which is perpendicular to $ at x. v(x) will be called 
the normal to S at x; clearly on SMV we have 


V¢(z) 


v(x) => *TW(a) 


Thus v is a C*-! function on S. If S is the boundary of a domain 0, we 
always choose the orientation so that v points out of 2. 

If u is a differentiable function defined near S, we can then define the 
normal derivative of u on S by 


Ousv-Vu. 


We pause to compute the normal derivative on the sphere S,(y). Since 
lines through the center of a sphere are perpendicular to the sphere, we 
have 


n 


(0.1) v(z) = = a, = * ye —yj)0; on S,(y). 


1 


We will use the following proposition several times in the sequel: 
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(0.2) Proposition. 
Let S be a compact oriented hypersurface of class C*, k > 2. There is a 
neighborhood V of § in IR" and a number € > 0 such that the map 


F(z,t)=2+tv(2) 
is a C*-} diffeomorphism of S x (—e,¢) onto V. 


Proof (sketch): F is clearly C*-!. Moreover, for each x € S§ its 
Jacobian matrix (with respect to local coordinates on S x R) at (z,0) is 
nonsingular since v is normal to S. Hence by the inverse mapping theorem, 
F can be inverted on a neighborhood W, of each (2,0) to yield a C¥~} 
map 

Fr) :W, + (SOWs) x (—€e, €x) 


for some €, > 0. Since S is compact, we can choose {xj} C S such that 
the W,, cover S, and the maps rs patch together to yield a C*~! inverse 
of F from a neighborhood V of S to S x (—€,€) where € = min; €z;. i] 


The neighborhood V in Proposition (0.2) is called a tubular neigh- 
borhood of S. It will be convenient to extend the definition of the normal 
derivative to the whole tubular neighborhood. Namely, if u is a differen- 
tiable function on V, for x € S and —e <i < € we set 


(0.3) O,u(x + tv(x)) = v(x): Vu(e + ty(z)). 


If F = (F\,..., Fn) is a differentiable vector field on a subset of IR", its 
divergence is the function 


v-F => oF. 
1 


With this terminology, we can state the form of the general Stokes formula 
that we shal] need. 


(0.4) The Divergence Theorem. 


Let 2 C R® be a bounded domain with C1 boundary S = OQ, and let F 
be a C! vector field on Q. Then 


[Fw odo = [ v-Feaz. 
Ss a 
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The proof can be found, for example, in Treves [52, §10]. 
Every 2 € R" \ {0} can be written uniquely as x = ry with r > 0 and 
y € S,(0) — namely, r = jz| and y = x/[x|. The formula z = ry is called 
the polar coordinate representation of c. Lebesgue measure is given in 
polar coordinates by 
dz =r"~1 drdo(y), 


where do is surface measure on S;(0). (See Folland [14, Theorem (2.49)].) 
For example, if0<a<b< oo and AER, we have 


i. [Pde = [ [> n-14) dp — fun se fA #—n, 
<Iz|<s 51(0) ie Cease) if \ = —n, 


where w,, is the area of $,(0) (which we shall compute shortly). As an 
immediate consequence, we have: 


(0.5) Proposition. 

The function x —»+ |z|* is integrable on a neighborhood of 0 if and only 
if \ > —n, and it is integrable outside a neighborhood of 0 if and only if 
A<=—n. 


> 


As another application of polar coordinates, we can compute what is 
probably the most important definite integral in mathematics: 


(0.6) Proposition. 
f ecm? dz=1. 


Proof: Let I, = fg enrkt dz. Since e~*!I" = J]ye ~x8} , Fubini’s 
theorem shows that J, = (J,)", or equivalently that J, = (Ip)"/?, But in 
polar coordinates, 


2n 09 2 foe) 5 0° 
n= | if e7"" rdrdo=2n f re" ar =n [ e-"’ds=1, 8 
o Jo 0 0 


This trick works because we know that the measure of 5;(0) in R? is 
2m. But now we can turn it around to compute the area wp, of S,(0) in R® 
for any n. Recall that the gamma function I°(s) is defined for Res > 0 


by 
r= | e7't?—! dt. 
0 
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One easily verifies that 
I'(s +1) = sI(s), (1) = 1, l(a) = va. 


(The first formula is obtained by integration by parts, and the last one 
reduces to (0.6) by a change of variable.) Hence, if k is a positive integer, 


T(k)=(k-1)!, Pk + 5) = (b- gk - 9) (Gv. 


(0.7) Proposition. 
The area of S;(0) in R” is 


Ann? 
n= T(n/2)" 


. ~alel? ; ; 
Proof: We integrate e~"!*!" in polar coordinates and set 5 = mr?: 


= fe e7*lzl? de= | foe ~8r? 9-1 de de 
5,(0) 


= i < 2)~1 
=un [ en) dp = staf e~* 5(7/2)~1 dg 


_ Wal(n/2) 
Qnn/2 7 


\ 


Note that, despite appearances, w,, is always a rational multiple of an 
integer power of 7. 


(0.8) Corollary. 
The volume of B,(0) in R” is 


Wn _ Qnnl2 
n~ n¥(n/2) 
Proof: Jayco) dz = Wp { mdr =w,/n. ' 


(0.9) Corollary. 
For any x € R" and any r > 0, the area of S,(x) isr 
of B,(z) is r?w,/n. 


n~1u, and the volume 
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C. Convolutions 


We begin with a general theorem about integral operators on a measure 
space (X,) which deserves to be more widely known than it is. In our 
applications, X will be either R” or a smooth hypersurface in R". 


(0.10) Generalized Young’s Inequality. 


Let (X,p) be a o-finite measure space, and let 1 < p < co and C > 0. 
Suppose K is a measurable function on X x X such that 


sup f IK(2,v)lduv) <0, sup [ IK(e, wl due) <C. 
zEX JX yeX JX 
If f € L?(X), the function Tf defined by 


TH2)= f Keys) duty) 
is well-defined almost everywhere and is in L?(X), and ||T f\lp < Cllfllp- 


Proof: Suppose 1 < p < oo, and let g be the conjugate exponent 
(p-! + q7! = 1). Then by Hélder’s inequality, 


irr s | f Ke. mldu)| : [fv allen date a 


1/ 
<our if. IX (2, yML/CoP duty) fe 


Raising both sides to the p-th power and integrating, we see by Fubini’s 
theorem that 


| ITF(2)/? du(x) < CP! | ip [K (x, wILF(W)P duly) dy(2) 
x XIX 
< CO/)+1 y Lf(u)IP du(y), 
xXx 
or, taking pth roots, 


ITfllp < CO O/ FI], = Cll fllp- 


These estimates imply, in particular, that the integral defining T’ f(x) con- 
verges absolutely a.e., so the theorem is proved for the case 1 < p < oo. 
The case p = 1 is similar but easier and requires only the hypothesis 
J \K(z,y)|du(2) < C, and the case p = oo is trivial and requires only 
the hypothesis f |K(z, y)|du(y) < C. 1 


10. Chapter 0 


In what follows, when we say L? we shall mean L?(R") unless another 
space is specified. 

Let f and g be locally integrable functions on R". The convolution 
f *g of f and g is defined by 


feg(e)= / fle—yoW) ey = / f(y)gle — v) dy = 9 * F(2), 


provided that the integrals in question exist. (The two integrals are equal 
by the change of variable y — x — y.) The basic theorem on the existence 
of convolutions is the following: 


(0.11) Young’s Inequality. 
If f € L' andg € LP (1< p< 00), thenfxge L? and ||f*gllp < UF lilly. 


Proof: Apply (0.10) with X = R" and K(z,y) = f(# — y). | 


Remark: It is obvious from IIélder’s inequality that if f € DL? and 
g © LP where p>! + q7' = 1 then f *g € L© and {If * glloo < [lFllellallp- 
From the Riesz-Thorin interpolation theorem (see Folland [14]) one can 
then deduce the following generalization of Young’s inequality: Suppose 
1<p,ar<coandp!4q7!'=r7341. If f € LY and g € IP then 
fg EL" and \\f * all, < \lfllellalle- 


The next theorem underlies one of the most important uses of convo- 
lutions. Before coming to it, we need a technical lemma. If f is a function 
on R” and zx € R", we define the function f, by 


f(y) = f(x + y). 


(0.12) Lemma. 
If 1 < p< oo and f € I, then limzo ||fz — f\lp = 0. 


Proof: If g is continuous with compact support, then g is uniformly 
continuous, so gz — g uniformly as s — 0. Since g, and g are supported 
in a common compact set for |z| < 1, it follows also that |lgz — gllp — 0. 
Now, given f € L? and ¢ > 0, choose a continuous g with compact support 
such that ||f — gllp < €/3. Then also ||fz — gzllp < €/3, so 


fe rs Filp < \Ife - Gx\lp + \lgz ae lp + ily am fllp < \Igz > g\lp + 2€/3. 


But for « sufficiently small, |lgz — gilp < €/3, so ||fz — fllp < €. i] 
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Remark: This result is false for p = co. Indeed, the condition that 
\|fc — flloo + 0 as e -+ 0 means precisely that f agrees almost everywhere 
with a uniformly continuous function. 


(0.13) Theorem. 

Suppose ¢ € L} and f(z) dx = a. For each € > 0, define the function ¢, 
by ¢(z) = e"" (e712). If f € L?, 1 <p < 00, then f* 6, af in the L? 
norm as¢€ — 0. If f € L® and f is uniformly continuous on a set V, then 
f *¢.— af uniformly on V ase — 0. 


Proof: By the change of variable « — ex we see that f ¢,(xz)dz =a 
for alle > 0. Hence, 


frbc(2)—af(2) = / [f(e—w)— F(2)]de(u) dy = / [F(o—ey)- f(2)]14(y) dy. 


If f € L? and p < oo, we apply the triangle inequality for integrals 
(Minkowski’s inequality; see Folland [14]) to obtain 


IIf *be—afllp < / If-ey ~ flle|d(w)| dy. 


But (|f_ey —fl|lp is bounded by 2||f||, and tends to zero as « — 0 for each y, 
by Lemma (0.12). The desired result therefore follows from the dominated 
convergence theorem. 

On the other hand, suppose f € L™ and f is uniformly continuous on 
V. Given 6 > 0, choose a compact set W so that Saw |¢| < 6. Then 


sup [f+ dz) af(e)| < sup [fle~ ev) — F02)I [161+ 2Ilfllos 
zeV reV, yew w 

The first term on the right tends to zero as € — 0, and 6 is arbitrary, so 

f * ¢_ tends uniformly to af on V. r] 


If ¢ € L' and f ¢(x) dz = 1, the family of functions {¢.}<>0 defined 
in Theorem (0.13) is called an approximation to the identity. What 
makes these useful is that by choosing ¢ appropriately we can get the 
functions f * ¢, to have nice properties. In particular: 


(0.14) Theorem. 
If f € L? (1 < p< co) and ¢ is in the Schwartz class §, then f * ¢ is C™ 
and 0°(f *¢) = f * 0%¢ for all multi-indices a. 
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Proof: If ¢ €§, for every bounded set V C R® we have 


sup |8°A(z — vy) SCav(l+|ul-" (ye R"). 
zc 
The function (1+ |y|)~"7! is in L* for every g by (0.5), so the integral 


f*0%(2) = ') I(u)0%6(2 — y) dy 


converges absolutely and uniformly on bounded subsets of R”. Differen- 
tiation can thus be interchanged with integration, and we conclude that 


0°(f *¢) = f + 0%. : 


We can get better results by taking ¢ € C2°. In that case we need only 
assume that f is locally integrable for f * ¢ to be well-defined, and the 
same argument as above shows that f+*+¢EC@™. 

Since the existence of nonzero functions in CS is not completely trivial, 
we pause for a moment to construct some. First, we define the function f 


Rb 
~ et ae (lt| <1), 
FO= Yo (ltl > 1). 


Then f € C2°(IR), so (x) = f(|z|?) is a nonnegative C® function on R” 
whose support is 8,(0). In particular, fp > 0,80 ¢ = / f is a function 
in C(I") with f ¢ = 1. It now follows that there are lots of functions in 


Ce: 


(0.15) Lemma. 
If f is supported in V and g is supported in W, then f *g is supported in 
{aty:réEV, yew}. 


Proof: Exercise. ' 


(0.16) Theorem. 
Co ts dense in L? for 1 <p < oo. 


Proof: Choose ¢ € Ce with f ¢ = 1, and define ¢, as in Theorem 
(0.13). If f € L? has compact support, it follows from (0.14) and (0.15) 
that f +. € Co° and from (0.13) that f *¢, — f in the L? norm. But L? 


functions with compact support are dense in L?, so we are done. | 


Another useful construction is the following: 
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(0.17) Theorem. 
Suppose V C R® is compact, 2 C R® is open, and V C Q. Then there 
exists f € CS°(Q) such that f = 1 on V and0 < f < 1 everywhere. 


Proof: Let 6 = inf{|z—y|: 2 EV, y € Q}. If Q=R", let 6 = 1.) 
By our assumptions on V and Q, 6 > 0. Let 


U = {x:|z—y| < 46 for some y€ V}. 


Then V CU and U CQ. Let x be the characteristic function of U, and 
choose a nonnegative ¢ € C'2°(Bs/2(0)) such that f¢ = 1. Then we can 
take f = x + ¢; the simple verification is left to the reader. t 


We can now prove the existence of “partitions of unity.” We state the 
following results only for compact sets, which is all we need, but they can 
be generalized. 


(0.18) Lemma. 
Let K C IR" be compact and let V,,..., Vw be open sets with K C Ny. 
Then there exist open sets W;,..., Ww with W; CV; and K CU; Wj. 


Proof: For each € > 0 let V; be the set of points in a Vj whose distance 
from R” \ ve is greater than e. Clearly V is open and VF Cc Vj. We claim 
that K Cc UY V;¥* if € is sufficiently seul. Otherwise, bx each € > 0 there 
exists «, € K \ ey Vi‘. Since K is compact, the x, have an accumulation 
point z € K ase +0. But then z € K \UW Vj, which is absurd. I 


(0.19) Theorem. 

Let K C IR" be compact and let Vi,...,VnN be bounded open sets such 
that KC UN V;. Then there exist functions ¢),...,¢n with ¢; € C2°(Vj) 
such that }>) ¢j = lon K. 


Proof: Let W,,...,Wwy be as in Lemma (0.18). By Theorem (0.17), 
we can choose ¢; € CO°(V;) with 0 < ¢; < 1 and 4; = 1 on W,;. Then 
G= pBAl $j; > 1 on K, so we can take ¢; = ¢;/®, with the understanding 
that ¢; = 0 wherever $; = 0. | 


The collection of functions {¢; }¥ is called a partition of unity on K 
subordinate to the covering {V;}1’. 
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D. The Fourier Transform 


In this section we give a rapid introduction to the theory of the Fourier 
transform. For a more extensive discussion, see, e.g., Strichartz [47] or 
Folland [14], [17]. 

If f € L1(R*), its Fourier transform f is a bounded function on R” 
defined by 


fl = | e282 € Fn) de. 


There is no universal agreement as to where to put the factors of 27 in the 
definition of f, and we apologize if this definition is not the one the reader 
is used to. It has the advantage of making the Fourier transform both an 
isometry on L? and an algebra homomorphism from L! (with convolution) 
to L°° (with pointwise multiplication). 

Clearly F() i is well-defined for all € and \Fllco < < || f||1. Moreover: 


(0.20) Theorem. in 
If f,g EL! then (f *g) = fg. 


Proof: This is a simple application of Fubini’s theorem: 
(sx are) = [ft se - dalw)ay de 
= // en ale) E F(x — yo 2" Eg(y) da dy 
= Fe fe oy) dy = FOTO. ' 


The Fourier transform interacts in a simple way with composition by 
translations and linear maps: 


(0.21) Proposition. 

Suppose f € L}(IR"). x 

a. If fa(x) = f(x +a) then (fal (E) = e?™** F(€). 

b. If T is an invertible linear transformation of R", then (f o TY (é) = 
| det T|- f((T-)*€). - 

c. If T is a rotation of R", then (f oT) =f oT. 


Proof: (a) and (b) are easily proved by making the substitutions 
y = 2+a and y = Tz in the integrals defining (fa) (€) and (f o TY(6), 
respectively. (c) follows from (b) since T* = T~! and |detT| = 1 when T 
is a rotation. I 
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The easiest way to develop the other basic properties of the Fourier 
transform is to consider its restriction to the Schwartz class 8. In what 
follows, if a is a multi-index, c*f denotes the function whose value at z is 


x f(z). 


(0.22) Proposition. 

Suppose fes. 

a. f €C@™ and OPf= = [(—2aiz)? ff. 
b. (8° fy = (amie)? f. 


Proof: To prove (a), just differentiate under the integral sign. To 
prove (b), write out the integral for (0° f)(€) and integrate by parts; the 
boundary terms vanish since f and its derivatives vanish at infinity. i] 


(0.23) Proposition. 
If f ES then f ES. 


Proof: By Proposition (0.22), 
OP Ef = (—1)!Fl (2aa)l4l-lal [ph ge io 


so OP E> fF is bounded for all a, 8. It then follows by the product rule for 
derivatives and induction on @ that £%08 f is bounded for all a, @, that is, 
fes. ‘ 


(0.24) The Riemann-Lebesgue Lemma. 
If f € L' then f is continuous and tends to zero at infinity. 


Proof: This is true by Proposition (0.23) if f lies in the dense sub- 
space 8 of L1. But if {fj} C 8 and f; > f in L', then fj — f uniformly 
(because ||f; — flloo < ll fj — fil), and the result follows immediately. 8 


(0.25) Theorem. 
Let f(x) = e~"#l” where a > 0. Then 


FE) = aM ME"/2, 
Proof: By making the change of variable z + a~!/?x we may assume 


a = 1. Since the exponential function converts sums into products, by 
Fubini’s theorem we have 


n 
Fe) = fn ae 1) oa dz;, 
L 
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and it suffices to show that the jth factor in the product is eG i.e., to 
prove the theorem for n = 1. Now when n = 1 we have 


[ee dz = ene? pores: dz. 


But f(z) = e7**” is an entire holomorphic function of z € C which dies out 
rapidly as | Rez| — oo when |Imz] reamins bounded. Hence by Cauchy’s- 
theorem we can shift the contour of integration from Imz = 0 toImz = —, 
which together with (0.6) yields 


ene / en TEE)? dy = gtO? ‘i e~*®" dx = e~"€. | 
(0.26) Theorem. 


If f,g € 8 then f fg =f fo. 


Proof: By Fubini’s theorem, 


[ae // F(x) g(y)eW?"7 dy dz = [is 1 


For f € L!, define the function fY by 
f‘(2) = feress@ dé = f(-z). 
(0.27) The Fourier Inversion Theorem. 
If f € 8, (f)” = f. 


Proof: Given ¢€ > 0 and x € R", set $(€) = e2"!= &-*7 El’, Then by 
Theorem (0.25), 


Bey) = frre s err ee dg = mere whl, 
Thus, 
$(y) = €-g(e7*(@ — y)) = ge(a — y) where g(x) = e7 "FI, 
By (0.26), then, 
| ae aC) df= | f= [= [ tae» dy = fxg.(z). 


By (0.6) and (0.14), f*g.— f uniformly as € — 0 since functions in § are 
uniformly continuous. But clearly, for each z, 


[eteut arnt Fe) ag s [er=*Fo dé = (f)"(z). 1 
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(0.28) Corollary. 
The Fourier transform is an isomorphism of § onto itself. 


(0.29) The Plancherel Theorem. 
The Fourier transform on 8 extends uniquely to a unitary isomorphism of 
L? onto itself. 


Proof: Since § is dense in L? (Theorem (0.16)), by Corollary (0.28) 
it suffices to show that IIflla= = |\flle for f € 8. If f € 8, set g(x) = F(—z). 
One easily checks that 9 = f. Hence by Theorems (0.20) and (0.27), 


Wik =f eFeae = F+9(0) = [Forde = f KoFea 
= WA. 


The results (0.20)-(0.29) are the fundamental properties of the Fourier 
transform which we shall use repeatedly. We shall also sometimes need the 
Fourier transform as an operator on tempered distributions, to be discussed 
in the next section, and the following result. 


(0.30) Proposition. 
If f € L! has compact support, then f extends to an entire holomorphic 
function on (’. If f € Co°, then F(©) is rapidly decaying as | Re€é| —+ co 
when |Im€] remains bounded. 

Proof: The integral f(é) = fe-?*€ f(x) dz converges for every 
€ € C’, and e~?*'*€ is an entire function of € € C*. Hence one can 
take complex derivatives of f simply by differentiating under the integral. 


Moreover, if f € Co and f is supported in {x : |z| < K}, for any multi- 
index a we have 


(amie) F(€)| = I/ en REO f(x) da! < eX Mm ella" FIl5, 


which yields the second assertion. 1 


E. Distributions 


We now outline the elements of the theory of distributions. The material 
sketched here is covered in more detail in Folland [14] and Rudin [41], and a 
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more extensive treatment at an elementary level can be found in Strichartz 
[47]. See also Treves [49] and Hormander [27, vol. I] for a deeper study of 
distributions. 

Let be an open set in R®. We begin by defining a notion of sequential 
convergence in C2°(Q). Namely, we say that ¢; > ¢ in C2(Q) if the ¢;’s 
are all supported in a common compact subset of Q and 8%; —> 0% 
uniformly for every multi-index a. (This notion of convergence comes from 
a locally convex topology on C9°(Q), whose precise description we shal! not 
need. See Rudin [41] or Treves [49].) 

If u is a linear functional on the space C'2°(Q), we denote the number 
obtained by applying u to ¢ € C°(Q) by (u, ¢) (or sometimes by (¢, u): 
it is convenient to maintain this flexibility). A distribution on Q is a 
linear functional u on C>°(Q) that is continuous in the sense that if 6; > ¢ 
in C°(Q) then (u,$;) — (u,¢). A bit of functional analysis (cf. Folland 
[14, Prop. (5.15)]) shows that this notion of continuity is equivalent to the 
following condition: for every compact set K C Q there is a constant Cx 
and an integer Nx such that for all ¢ € C2°(K), 


(0.31) Ku ASC SO [0% 


lol SNK 


The space of distributions on Q is denoted by D/(Q), and we set D! = 
D‘(IR"). We put the weak topology on D’(Q); that is, uj + u in D’(Q) if 
and only if (u;,¢) — (u, ¢) for every ¢ € C°(Q). 

Every locally integrable function u on 2 can be regarded as a dis- 
tribution by the formula (u,¢) = f ud, which accords with the notation 
introduced earlier, (The continuity follows from the Lebesgue dominated 
convergence theorem.) This correspondence is one-to-one if we regard two 
functions as the same if they are equal almost everywhere. Thus distribu- 
tions can be regarded as “generalized functions.” Indeed, we shall often 
pretend that distributions are functions and write (u,¢) as f u(x)¢(x) dz; 
this is a useful fiction that makes certain operations involving distributions 
more transparent. 

Every locally finite measure pz: on 2 defines a distribution by the formula 
(u,¢) = [{ ¢dp. In particular, if we take yz: to be the point mass at 0, we 
obtain the graddaddy of all distributions, the Dirac 6-function 6 € D! 
defined by (6,¢) = ¢(0). Theorem (0.13) implies that if u € L', fu=a, 
and u.(z) = e~"u(e~12), then u, + ad in D! when € > 0. 

Ifu,v € D'(Q), we say that u = v on an openset V C Qif (u, ¢) = (v, 6) 
for all 6 € C°(V). The support of a distribution u is the complement of 
the largest open set on which u = 0. (To see that this is well-defined, one 
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needs to know that if {Va}aea is a collection of open sets and u = 0 on 
each V,, then u=0 on UV,. But if 6 € C°(UV.), supp ¢ is covered by 
finitely many V,,’s. By means of a partition of unity on supp ¢ subordinate 
to this covering, one can write ¢ = ey ¢@; where each ¢; is supported in 
some V,. It follows that (u, 4) = )>(u, ¢;) = 0, as desired.) 

The space of distributions on R” whose support is a compact subset of 
the open set Q is denoted by &/(), and we set &’ = E€'(IR”). 

Suppose u € €'. Let 2 be a bounded open set such that supp u C 2, and 
choose » € C2°(Q) with w = 1 on a neighborhood of supp u (by Theorem 
(0.17)). Then for any ¢ € CS we have 


(u, $) = (u, vd). 


This has two consequences. First, u is of “finite order”: indeed, by (0.31) 


with K =Q, 
Ku, AIS Cay SO [O%(v4)lleo. 


lal<Ng 


Expanding 0°(~¢) by the product rule, we see that 


(0.32) tu, P< © D7 sup 18% 4(2)) 


lalgn” 


where N = Nz and C depends only on Cy and the constants {|0°p[Io0, 
|B] < N. Second, (u, ¥¢) makes sense for all 6 € C™, compactly supported 
or not, so if we define (u,¢) to be (u,¥¢) for all ¢ € C™, we have an 
extension of u to a linear functional on C™. This extension is clearly 
independent of the choice of y, and it is unique subject to the condition 
that (u, @) = 0 whenever supp ¢ and supp u are disjoint. Thus distributions 
with compact support can be regarded as linear functionals on C™ that 
satisfy estimates of the form (0.32). Conversely, the restriction to (9° 
of any linear functional on C™ satisfying (0.32) is clearly a distribution 
supported in 2. 


The general philosophy for extending operations from functions to dis- 
tributions is the following. Let T be a linear operator on C$°(Q) that is 
continuous in the sense that if ¢; > ¢ in C2°(Q) then T¢; — T¢ in CS°(Q). 
Suppose there is another such operator TJ’ such that [(T¢)y = f ¢(T") 
for all 6, Y € C°(Q). (We call 7” the dual or transpose of T.) We can 
then extend T to act on distributions by the formula 


(Tu, ¢) = (u,T’9). 
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The linear functional Tu on C°(Q) defined in this way is continuous on 
C°(Q) since TJ” is assumed continuous. The most important examples are 
the following; in all of them the verification of continuity is left as a simple 
exercise. 


1. Let T be multiplication by the function f € C~@(Q). Then JT’ = T, 
so we can multiply any distribution u by f € C(Q) by the formula 
(fu, @) = (u, fd). 

2. Let T = 6%. By integration by parts, T’ = (—1)!#!a%. Hence we 
can differentiate any distribution as often as we please to obtain other 


distributions by the formula (0%u, 6) = (—1)!#!(u, 8%). 


3. We can combine (1) and (2). Let T = }7)4)<44a0% be a differential 
operator of order k with C™ coefficients ag. Integration by parts shows 
that the dual operator T” is given by T’¢ = Dale (—1)!18%(aa9). For 
any distribution u, then, we define Tu by (Tu, ¢) = (u, T’¢). 


Clearly, if u € C*(Q), the distribution derivatives of u of order < k are 
just the pointwise derivatives. The converse is also true: 


(0.33) Proposition. 
If u € C(Q) and the distribution derivatives O%u are in C(Q) for \a| < k 
then u € C*(Q). 


Proof: By induction it suffices to assume that k = 1. Since the 
conclusion is of a local nature, moreover, it suffices to assume that Q is a 
cube, say 2 = {x : max|z; — yj| < r} for some y € R®. For z EQ, set 

Ty 
v(x) = Oyu(t,x2,...,2n)dt+ uly, Z2,-..,2n). 
ya 
It is easily checked that v and u agree as distributions on Q, hence v = u 
as functions on Q. But @,u is clearly a pointwise derivative of v. Likewise 
for Oqu,...,O,u; thus u € C1(Q). ' 


We now continue our list of operations on distributions. In all of the 
following, we take 2 = R”. 


4, Given z € R", let T¢ = ¢,, where ¢c(y) = ¢(a@ + y). Then T’¢ = ¢_z. 


Thus for any distribution u, we define its translate uz by (uz, ¢) = 
{u, 6-2). 


5. Let Td = 4, where (x) = $(—z). Then 7” = T, so for any distribution 
u we define its reflection in the origin u by (u, ) = (u, d). 
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6. Given » € CO, define Td = ¢* ¥, which is in Ce. by (0.14) and 

(0.15). It is easy to check that T’¢ = ¢* w, where y is defined as in 
(5). Thus, if u is a distribution, we can define the distribution u * y 
by (ux ¥,¢) = (u,¢* 2). On the other hand, notice that ¢ + ¥(z) = 
(¢, (¥z) ), so we can also define u*y pointwise as a continuous function 
by ux $(z) = (u, (ve)). 
In fact, these two definitions agree. To see this, let ¢ € CS, let K be 
a compact set containing supp(¥.) for all x € supp ¢, and let Nx be 
as in (0.31). From the relation ¢ * v(y) = f O(z)(vz2)(y) dz it is is not 
hard to see that there is a sequence of Riemann sums )> ¢(z;)(W2,) Az; 
that converge uniformly to gee together with their derivatives of order 
< Nx. But then (0.31) implies that if u* yp is defined as a continuous 
function, we have 


(uxy,¢) = lim) u * w(x; )¢(a;) Az; 
= lim }7(u, (de; )b(25) Aas = (u,b * 9), 


which is the action of the distribution u* ~ on ¢. 
Moreover, by (2) and integration by parts, we see that the distribution 
%(u * P) is given by 


(0%(u* p), ¢) = (—I)lel (u, (87d) + d) = (u,b * (O%V)) = (u* 07d, 4), 


so 0° (ue) = u* Oy is a continuous function. Hence u + is actually 
a C® function. 


7. The same considerations apply when u € €’ and » € C™. That is, we 


can define u* # either as a distribution by (u* ¥,¢) = (u,¢* ), or as 
a C™ function by u +(x) = (u, (2) ). 


8. If u € €’ and # E C®, as in (0.15) we see that ux ~ € Co’. Hence 
we can consider the operator Ty = u+y on C>°, whose dual is clearly 
Ty = t+ wy. It follows that if u € €’ and v € D/, u*v can be 
defined as a distribution by the formula (u + v,¥) = (v,a*y). We 
leave it as an exercise to verify that for any multi-index a we have 
O°%(u xv) = (O%u) + v = u* (0%). 

We shall also need to consider the class of “tempered distributions.” 
We endow the Schwartz class 8 with the Fréchet space topology defined by 
the family of norms ||¢|l(,6) = ||z°O°¢lloo. That is, ¢; — ¢ in § if and 
only if 

sup [2° 0°(¢; — ¢)(x)| > 0 for all a, f. 
=z 
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A tempered distribution is a continuous linear functional on §; the space 
of tempered distributions is denoted by 8’. Since C2? is a dense subspace of 
§ in the topology of S, and the topology on C2 is stronger than the topology 
on §, the restriction of every tempered distribution to Co? is a distribution, 
and this restriction map is one-to-one. Hence, every tempered distribution 
“is” a distribution. On the other hand, Proposition (0.32) shows that 
every distribution with compact support is tempered. Roughly speaking, 
the tempered distributions are those which “grow at most polynomially at. 
infinity.” For example, every polynomial is a tempered distribution, but 
u(z) = el*l is not. (Exercise: prove this.) 

One can define operations on tempered distributions as above, simply 
by replacing C'S by 8. For example, if u € 8’, then: 


1. 0%u is a tempered distribution for all multi-indices a; 


2. fu is a tempered distribution for all f € C@ such that O°%f grows at 
most polynomially at infinity for all a; 


3. uxd@ is a tempered distribution, and also a C™ function, for any ¢ € 5. 


The importance of tempered distributions lies in the fact that they 
have Fourier transforms. Indeed, since the Fourier transform maps 8 con- 
tinuously onto itself and is self-dual by Proposition (0.26), for any u € 8’ 
we can define @ € 8! by (a, ¢) = (u, ) (¢ € $8), which is consistent with 
the definition for functions. It is easy to see that Propositions (0.21) and 
(0.22) are still valid when f € 8’, as is the Fourier inversion theorem (0.27), 
provided f¥ is defined by (f”,¢) = (f,¢%). Also, if u & 8’ and ¢ € 38, we 
have (u*@) = $i; the proof is left as an easy exercise. 

For example, the Fourier transform of the Dirac 6-function is given by 
(6, ¢) = (6,6) = 6(0) = f (x) dz = (1,4), so 6 is the constant function 1. 
It then follows from the inversion theorem that 1 = 6, and from Proposition 
(0.22) that (€°%5)° = (2mi)lelé and that (a2) = (i/2n)!210%6. 


F. Compact Operators 


Let X be a Banach space and let T be a bounded linear operator on X. We 
denote the nullspace and range of T by N(T) and R(T). T is called com- 
pact if whenever {2;} is a bounded sequence in X, the sequence {T'z;} has 
a convergent subsequence. Equivalently, T is compact if it maps bounded 
sets into sets with compact closure. T' is said to be of finite rank if 
R(T) is finite-dimensional. Clearly every bounded operator of finite rank 
is compact. 
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(0.34) Theorem. 
The set of compact operators on X is a closed two-sided ideal in the algebra 
of bounded operators on X with the norm topology. 


Proof: Suppose T, and T2 are compact, and {z;} C X is bounded. 
We can choose a subsequence {y;} of {x;} such that {71 y;} converges, and 
then choose a subsequence {z;} of {y;} such that {T2z;} converges. It 
follows that 4,7, + a27T> is compact for all aj,a2 € C. Also, it is clear that 
if T is compact and S is bounded, then T'S and ST are compact. Thus the 
set of compact operators is a two-sided ideal. 

Suppose {Tj} is a sequence of compact operators converging to a limit 
T in the norm topology. Given a seqence {z;} C X with ]|xz;|| < C for all 
j, choose a subsequence {x1;} such that {T\21;} converges. Proceeding 
inductively, for m = 2,3,4,..., choose a subsequence {tm;} of {t(m—1);} 
such that {T;,2mj} converges. Setting y; = xj;, one easily sees that {Tny; } 
converges for all m. But then 

[IZ yj ~ Tyell < ML — Ton)usll + [Zoo(u5 ~ ve) + IMZen — Tol 
S 2C||T — Tm|| + I|Tmyj — Tye ll 
Given € > 0, we can choose m so large that ||[7’— Tm|| < €/4C, and then 
with this choice of m we have ||Tny; — Imyz|| < €/2 when 7 and k are 
sufficiently large. Thus {T'y;} is convergent, so T' is compact. I 


(0.35) Corollary. 
If T is a bounded operator on X and there is a sequence {T,} of operators 
of finite rank such that ||T,, — T|| + 0, then T is compact. 


In case X is a Hilbert space, this corollary has a converse. 


(0.36) Theorem. 
If T is a compact operator on a Hilbert space X, then T is the norm limit 
of operators of finite rank. 


Proof: Suppose ¢ > 0, and let B be the unit ball in X. Since T(B) 
has compact closure, it is totally bounded: there is a finite set y1,...,4n 
of elements of T(B) such that every y € T(B) satisfies ||y — y;|| < € for 
some j. Let P, be the orthogonal projection onto the space spanned by 
Yi,--+)Yn, and set T, = P.T. Then 7, is of finite rank. Also, since T,z is 
the element closest to Tr in R(P,), for zc € B we have 


Tx — T,2\|| < i —y é 
\|T'z ez|| < ymin |lT2 yll<e 


In other words, ||T’ — T.|| < €, so T; +T ase — 0. 1 
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Remark: For many years it was an open question whether Theorem 
(0.36) were true for general Banach spaces. The answer is negative even 
for some separable, reflexive Banach spaces; see Enflo [12]. 


(0.37) Theorem. 
The operator T on the Banach space X is compact if and only if the dual 
operator T* on the dual space X* is compact. 


Proof: Let B and B* be the unit balls in X and X*. Suppose T is 
compact, and let {f;} be a bounded sequence in X*. Multiplying the f;’s 
by a small constant, we may assume {f;} C B*. The functions fj :X—+C 
are equicontinuous and uniformly bounded on bounded sets, so by the 
Arzela-Ascoli theorem there is a subsequence (still denoted by { f;}) which 
converges uniformly on the compact set T(B). Thus T*f;(r) = f;(Tz) 
converges uniformly for z € B, so {T* f;} is Cauchy in the norm topology 
of X*. Hence TJ™ is compact. 

Likewise, if T* is compact then T** is compact on X**. But X is 
isometrically embedded in X**, and T' is the restriction of T** to X, so T’ 
is compact. i] 


We now present the main structure theorem for compact operators. 
This theorem was first proved by I. Fredholm (by different methods) for 
certain integral operators on L? spaces. In the abstract Hilbert space set- 
ting it is due to F. Riesz, and it was later extended to arbitrary Banach 
spaces by J. Schauder. For this reason it is sometimes called the Riesz- 
Schauder theory. We shall restrict attention to the Hilbert space case, 
which is all we shall need, and for which the proof is easier; see Rudin {41] 
for the general case. 


(0.38) Fredholm’s Theorem. 
Let T be a compact operator on a Hilbert space X with inner product (-|-). 
For each 4 € C, let 


Vi={xEX:Tr=Az}, Wa = {2 EX: T*z = dz}. 


Then: 

a. The set of \ € C for which V) # {0} is finite or countable, and in the 
latter case its only accumulation point is 0. Moreover, dim(V,) < 00 
for allXA £0. 

b. If A #0, dim(V,) = dim(W ). 

c. If} #0, R(AI —T) is closed. 
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Proof: (a) is equivalent to the following statement: For any € > 0, 
the linear span of the spaces V, with |A| > ¢ is finite-dimensional. Suppose 
to the contrary that there exist « > 0 and an infinite sequence {z;} C X of 
linearly independent elements such that Tz; = Aj2j with |A;| > € for all j. 
Since |A;| < ||7'Il, by passing to a subsequence we can assume that {;} is 
a Cauchy sequence. Let X,, be the linear span of 21,...,2m. For each m, 
choose ym € Xm with ||yml| = 1 and ym L Xm-1. Then ym = Soy emjz;j 
for some scalars cm; , SO 


mt m~-1 


A Tym = Ommtm + ¥.; cmjAjgAz 25 = Ym + > mj (Aj Am! — 1)2; 
1 1 
= Ym (mod Xp-1). 
If n < m, then, 
AT Tym — AZT yn = ym (mod Xm—1)- 
Therefore, since ym L Xm _1, the Pythagorean theorem yields 
I[\int Tum — Ax T Yn] > 1. 


But then 
1< JAR MT ym — Tyall + a? — An lllTyall, 


or 


Tyn — Tym] > Lm — [1 AmA' |T yall. 


As m,n —+ oo the second term on the right tends to zero since ||T'ynl| < ||Z'|| 
and AmAz! — 1, and the first term is bounded below by e. Thus {Tym} 
has no convergent subsequence, contradicting compactness. 

Now consider (b). Given 4 # 0, by Theorem (0.36) we can write 
T = To +7, where 7p has finite rank and ||7;[| < [A]. The operator 
AI —T, = A(I — A7!T}) is invertible (the inverse being given by the con- 
vergent geometric series }~>° \~*~17#), and we have 


(0.39) (AI-T,)71(AI-T) = (I-71)! I-T —To) = I-(AI-T1)7?%. 


Set Tz = (AI — 7,)7!Tp. Then clearly z € V) if and only if z — Toz = 0. 
On the other hand, taking the adjoint of both sides of (0.39), we have 


QI —T*)QI-—Tyf)7' = 1-73, 


26 Chapter 0 


so y = (AI—T;)~!z isin Wy if and only if s—T} xz = 0. We must therefore 
show that the equations z—T2z = 0 and z—Tyz = 0 have the same number 
of independent solutions. 

Since To has finite rank, so does Ty. Let u1,...,unw be an orthonormal 
basis for R(T2). Then for any z € X we have Toz = ae f;(z)u; where 
ey \fi(2)/? = ||To2ll?. It follows that x — f;(z) is a bounded linear 


functional on X, so there exist v1,...,un such that 


N 
Tet => (z|y)uj; (2 € X). 
1 


Set Bye = (u; | ve), and given x € X, set a; = (x|v;). If e -— Thx = 0, then 
z= S\} ajuj, and we see by taking the scalar prodict with vz that 


(0.40) az- >> Bjeay=0, k=1,...,N. 
7 


Conversely, if a,..., a, satisfy (0.40), then c = }\aj;u; satisfies s—Thx = 
0. On the other hand, one easily verifies that Tj z = S-(z|u,;)v;, so by the 
same reasoning, z — Tyz = 0 if and only if x = $7) a;ju;, where 


(0.41) ar— > Bpyaj =0, bk =1,...,N. 
j 


But the matrices (6;. — Gjx) and (6x — B,;) are adjoints of each other and 
so have the same rank. Thus (0.40) and (0.41) have the same number of 
independent solutions. 

Finally, we prove (c). Suppose we have a sequence {y;} C R(AI — T) 
which converges to an element y € X. We can write y; = (AI — T)z; 
for some z; € X; if we set x; = uj +; where uj € Va and v; 1 Vy, 
we have yj = (AI — T)v;. We claim that {v;} is a bounded sequence. 
Otherwise, by passing to a subsequence we may assume ||v;|| — oo. Set 
w; = v;/|\v;||; then by passing to another subsequence we may assume that 
{T'w;} converges to a limit z. Since the y;’s are bounded and ||v;|| —+ co, 


Aw; = Tw + — 00). 


inn 
Thus z L V) and |{z{] = |AI, but also 


(AI —T)z = lim(Q1I — T)Au; = oe Ie inl 
vj 
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so z € Vy. This is a contradiction since we assume A # 0. 
Now, since {v;} is a bounded sequence, by passing to a subsequence we 
may assume that {T'v;} converges to a limit x. But then 


v= Ay; +Tv;) > Ay +2), 
so 
y = lim(Al — T)v; = AI — T)AT (yy +2). 
Thus y € R(AI — T), and the proof is complete. i] 


(0.42) Corollary. 
Suppose \ # 0. Then: 

i. The equation (AI — T)z = y has a solution if and only if y L Wy. 
ii. AI — T is surjective if and only if it is injective. 


Proof: (i) follows from part (c) of the theorem and the fact that 
R(S) = N(S*)+ for any bounded operator S. (ii) then follows from (i) and 
part (b) of the theorem. | 


In general it may happen that the spaces V) are all trivial. (It is easy 
to construct an example from a weighted shift operator on /?.) However, if 
T is self-adjoint, there are lots of eigenvectors. 


(0.43) Lemma. 
IfT is a compact self-adjoint operator on a Hilbert space X, then either 
\|Z'|| or -||7'|] is an eigenvalue for T. 


Proof: Clearly we may assume T # 0. Let c = ||Z|| (so c > 0), and 
consider the operator A = c?J — T?. For all x € X we have 


(Az |x) = c?|lx||? ~ ||TzI|? > 0. 


Choose a sequence {xj} C X with ||z;|| = 1 and ||Tz;|]| - c. Then 
(Az; |z;) — 0, so applying the Schwarz inequality to the nonnegative 
Hermitian form (u,v) — (Au]|v), we see that 
{| Azs||? = (Aaj | Azj) < (Az, | 2j)'/?(A?2; | Ans)? 
< (Az; | 25)'/?|]A?2,{[7 |] Azyl|/? < |JAIP/?( Az; | 23)? > 0, 
so Az; — 0. By passing to a subsequence we may assume that {T'x;} 


converges to a limit y, which satisfies 


{yl} = lim [|T'z;|] = ¢ > 0, Ay = lim AT; = limT'Az; = 0. 
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In other words, 
y#O and Ay=(cl+T)(cI -T)y =0. 
Thus either Ty = cy or cy—Ty = z # 0 and Tz = —cz. | 


(0.44) The Spectral Theorem. 
IfT is a compact self-adjoint operator on a Hilbert space X, then X has an 
orthonormal basis consisting of eigenvectors for T. 


Proof: It is a simple consequence of the self-adjointness of T' that (i) 
eigenvectors for different eigenvalues are orthogonal to each other, and (ii) 
if Y is a subspace of X such that T(Y) C Y, then also T(Y+) c y+. In 
particular, let Y be the closed linear span of all the eigenvectors of T. If we 
pick an orthonormal basis for each eigenspace of J’ and take their union, 
by (i) we obtain an orthonormal basis for Y. By (ii), T|¥+ is a compact 
operator on Y“, and it has no eigenvectors since all the eigenvectors of T 
belong to Y. But this is impossible by Lemma (0.43) unless Y+ = {0}, so 
Y=X. | 


We conclude by constructing a useful class of compact operators on 
L?(y), where jp: is a o-finite measure on a space S. To simplify the argument 
a bit, we shall make the (inessential) assumption that L?(j) is separable. 
If K is a measurable function on S x S, we formally define the operator 
Tx on functions on S by 


T f(z) = [ K(e,vFu) duty). 
If K € L?(p x p), K is called a Hilbert-Schmidt kernel. 


(0.45) Theorem. 
Let K be a Hilbert-Schmidt kernel. Then Tx is a compact operator on 
L?(u), and ||Tk|| < IKll2- 


Proof: First we show that Tx is well-defined on L?(j:) and bounded 
by ||K{l2. By the Schwarz inequality, 


1/2 
IT sla) sf IK(e, VILE) dale) <| IK (eI? du(y)| lf lle- 
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This shows that 7x f is finite almost everywhere, and moreover 


WT A= f Wie F(a)? dua) < WB ff Ue, dP duty) dye) 
= WABI, 
so ||Tx|| < ||Klf2. 
Now let {¢;}9 be an orthonormal basis for L?(). It is an easy conse- 
quence of Fubini’s theorem that if ¥4;(z,y) = :(z)¢d;(y), then {Wij}? a1 


is an orthonormal basis for L?( x s). Hence we can write K = >" aijvij. 
For N =1,2,..., let 


Kn(z,y)= >> aijds(z.y)= S> aij ¢s(z)d;(y). 


i4j<N i47<N 


Then R(Tx,,) lies in the span of ¢1,...,¢N, so Tx, has finite rank. On 
the other hand, 


\|K -— Ky 


z= ye jai;|? + 0 as N +00, 
i4j>N 


so by the previous remarks, 
IZx —Tkyll < IK — Kn|l2 > 0 as N > 0. 


By Corollary (0.35), then, 7 is compact. i] 


Chapter 1 
LOCAL EXISTENCE THEORY 


In this chapter we set up the ideas and terminology with which we shall be 
working throughout the book, and we prove some basic results about local 
existence of solutions to partial differential equations. 


A. Basic Concepts 


Roughly speaking, a partial differential equation of order k is an 
equation of the form 


F(x,u,diu,...,Onu, O?u,...,dku) = 0 


relating a function u of the variable z € R®” and its derivatives of order 
< k. In order to keep the notation manageable, we introduce the following 
terminology. Let us order the set of multi-indices by saying that @ comes 
before @ if |a| < |B| or if |a| = |f| and a; < B; where 7 is the largest number 
with a; # #;. Given complex numbers ag (|a| < k), we denote by (aa)|a\<k 
the element of CY) given by ordering the a’s in this fashion, where N(k) 
is the cardinality of {a : |ja| < k}. Similarly, if S C {a : la] < k}, we can 
consider the ordered (card S)-tuple (aa)aes- 

Now let Q be an open set in R”, and let F be a function of the variables 
z € Qand (uadal<k € Cc’). Then we can form the partial differential 
equation 


(1.1) F(z, (8%u)jaj<z) = 0. 


A (complex-valued) function u = u(z) on Q is a classical solution of this 
equation if the derivatives 0%u occurring in F exist on 2, and 


F(z, (8% u(z))ja<e) = 0 forall EQ. 
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We are really being too vague. The general formula (1.1) includes ab- 
surd equations such as exp(@,u) = 0 which have no solutions for trivial 
reasons, as well as equations like (0,u)? — 4 = 0, which is really the dis- 
junction of the two equations 0,u = 2 and 0,u = —2. It also allows us to 
think of a kth order equation as a (k + m)th order equation for any m > 0. 
However, we shall not formulate precise restrictions on F until we consider 
more specific problems. 

The equation (1.1) is called linear if F is an affine-linear function of 
the vector variable (ue)joj<z, that is, if (1.1) can be rewritten as 


(1.2) > Gq(z)O%u = f(z). 


lalsk 


In this case we speak of the differential operator LD = Po lalek a,0% and 
write (1.2) simply as Lu = f. If the coefficients ag are C~® on 2 we 
can apply the operator L to any distribution u on 2, and uw is called a 
distribution solution or weak solution of (1.2) if (1.2) holds in the 
sense of distributions, namely, 


Ye (-Dlel(u, 6%(aad)) = (fd) (@ € C(O). 


Jal <k 


More general than the linear equations are the quasi-linear equa- 
tions, namely, those equations (1.1) where F is an affine-linear function of 
(ua)jo|=- Such equations can be written as 


(1.3) Do Gaz, (F u)ip.ce—1)d%u = b(x, (8° u)jajce—1)- 
le|=k 


The general questions with which we shall be concerned are the follow- 
ing: 


Existence. Can we find a solution u of (1.1), perhaps satisfying some 
preassigned conditions, in some neighborhood of a given point, or in some 
given domain? How can we construct solutions? 


Uniqueness. In general (1.1) will have many solutions. What kinds 
of boundary conditions on u will guarantee uniqueness? 


Behavior of solutions. What are the qualitative properties of solu- 
tions — in particular, differentiability properties? Do the solutions depend 
smoothly on the boundary conditions? In the linear case (1.2), does u 
depend smoothly on f? 
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The answers to these questions will depend on the nature of the con- 
straints imposed on the solution u by the equation (1.1), and these in turn 
depend very strongly on the nature of the equation under consideration, 
Let us look at some simple examples in R?: 


i. The general solution of the equation 0;u = 0 is clearly u(zy,22) = 
f(2) where f is arbitrary. Thus the equation 8,u = 0 gives complete 
information about the behavior of u as a function of x; (it must be 
constant), but none at all about its dependence on 2x9. 


ii. The general solution of 0,82.u = 0 is easily seen to be u(z,,22) = 
f(21)+9(x2) where f and g are arbitrary. Thus the equation 0,02u = 0 
gives no information about the dependence of u on either zy or x2 
when the other is held fixed, but merely says that the dependences are 
“uncoupled” in a certain sense. 


iii. Any solution of the Cauchy-Riemann equation 0,u + idgu = 0 isa 
holomorphic function of the complex variable z = 2x, + tz2, and in 
particular is C®. This equation imposes very strong restrictions on all 
the derivatives of the solution u. 


In the linear case, a simple measure of the “strength” of a differential 
operator in a certain direction is provided by the notion of characteristics. 
IfL= Dlalsk a,9* is a linear differential operator of order k on QC R”, 
its characteristic form at z € ( is the homogeneous polynomial of degree 
k on R® defined by 


X(zE)= D> aa(ze*  (€ ER”). 


laj=k 


A nonzero vector € is called characteristic for D at z if yz(«,é) = 0, and 
the set of all such € is called the characteristic variety of L at x and is 
denoted by char,(Z): 


char,(D) = {é #0: xz(z,4)= 0}. 


To see more clearly the meaning of the characteristic variety we must 
consider the effect of a change of coordinates. Let F be a smooth one-to- 
one mapping of Q onto some open set 1’, and set y = F(z). Assume that 
the Jacobian matrix J, = [(Oy;/z;)(z)] is nonsingular for z € Q, so that 
Yi,.++)Yn is a coordinate system on 2. We have 
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which we can write symbolically as 0, = Jid,, where J! is the transpose 
of J,. The operator E is then transformed into the operator 


Li = Yo aa(F7*(y))(Tp-1(y)y)* 
lal<e 


on 9’. When this is expanded out, the expression will involve derivatives 
of JP-1(y)i but these derivatives are only formed by “using up” some of 
the dy’s on Tp-iy)> so they do not enter into the highest order terms. One 
then sees that 


xxr(y, €) = s aq(F~*(y))(Jp-1(y)€)*- 


fal=k 


Since F—!(y) = z, on comparing with the expression 


xe(2,€) = S> aa(2)é* 


Jaf=k 


we see that char,(Z) is just the image of chary(L’) under the linear map 
Je. 

Another way of looking at this is the following. If we associate to 
€ € R" the differential form dé = )7} €; dy; and use the chain rule dy; = 
(Oy; /Ox;) dz;, we have 


Oy; 
Thus in the z-coordinates, df corresponds to the vector Ji€. In the termi- 
nology of differential geometry, char, (Z) is intrinsically defined as a subset 
of the cotangent space at z. 

Now, note that if € # 0 is a vector in the z-direction (i.e., & = 0 for 
i # j), then € € char, (L) if and only if the coefficient of oF in D vanishes at 
z. Moreover, given any € # 0, by a rotation of coordinates we can arrange 
for € to lie in a coordinate direction. Thus the condition € € char,(L) 
means that, in some sense, LD fails to be “genuinely kth order” in the € 
direction at z. 

L is said to be elliptic at x if char,(Z) = © and elliptic on 0 if it is 
elliptic at every c € Q. At least on the formal level, an elliptic operator of 
order k exerts control on all derivatives of order k. We shall see in Chapter 
6 that this formal statement is also valid analytically. 

Here are some examples. The first three are the ones discussed above, 
and the second three are the basic operators of mathematical physics which 
will be studied in detail in Chapters 2-5. 
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i. LD =: char,(L) = {€ $0: & =O}. 
ii. L = 0,09: char,(L) = {€ £0: &, =0 or & =O}. 


iii. L = 3(0, + 282) (the Cauchy-Riemann operator* on R?): L is elliptic 
on R2. 


iv. L = So} 0? (the Laplace operator): L is elliptic on R”. 

v. L = 8-32 4? (the heat operator): char,(L) = {€ # 0: €; = 0 for j > 
2}. 

vi. L = Of— 57} O? (the wave operator): char,(L) = {€ # 0: €? = So} €?}. 


One last definition. A hypersurface S is called characteristic for L at 
z € S if the normal vector v(z) to S at x is in char,(L), and S is called 
non-characteristic if it is not characteristic at any point. We remark that 
v(x) can be defined in a coordinate-free way as a cotangent vector at zx, 
ie., it transforms under coordinate changes by the same rule as char,(L). 
(In fact, v(x) is one of the two unit cotangent vectors at x that annihilate 
the tangent space to S at z; the choice of orientation is immaterial since 
xi(z,—€) = (-1)¥ xx(z,€).) Hence the condition “S is non-characteristic” 
is independent of the choice of coordinates. 

The notion of characteristics can be extended to the nonlinear theory, 
but there the situation is more complicated, as we shall see. 


B. Real First Order Equations 


Recall that the basic boundary value problem for first order ordinary dif- 
ferential equations is the initial value problem: given a function F on R% 
and (to, uo) € R2, find a function u(t) defined in a neighborhood of to such 
that F(t, u,u’) = 0 and u(to) = up. In this section we shall consider the 
corresponding initial value problem for real first order partial differential 
equations. Namely, given an equation F(z, (8°u)jaj<1) = 0 (where F and 
u are assumed to be real-valued), a hypersurface S, and a (real-valued) 
function ¢ on S, find asolution u defined on a neighborhood of § such that 
u=¢onS. This theory has a pleasantly geometric flavor. 


* Question: Why the factor of i? Answer: The operators 0, = $(O2 - idy ) and 
Oo; = 5 (de + idy) are the result of formally applying the chain rule to the change of 
variables z = £ + iy, Z = z— iy. When applied to holomorphic functions, 0, is just 


the complex derivative. 
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Let us first consider the linear equation 
(1.4) Lu= Sa; dju+ bu = f, 


where aj, 6, and f are assumed to be C! functions of z. If we denote by A 
the vector field (@1,...,@n), we have 


char,(L) = {€ £0: A(z) -€ = 0}, 


that is, char,(L) U {0} is the hyperplane orthogonal to A(x). From this it 
is clear that a hypersurface S is characteristic at x if and only if A(z) is 
tangent to S at x. 

Suppose we wish to find a solution u of (1.4) with given initial values 
u = ¢ on the hypersurface S. If S is characteristic at xo, the quantity 
>> a; (x0)0;u(zo) is completely determined as a certain directional deriva- 
tive of @ along S at zo, and it may be impossible to make it equal to 
f (20) — b(zo)u(zo). (For example, if the equation is 0:u = 0 and S is 
the hyperplane rz, = 0, we cannot have u = ¢ on S unless 0,¢ = 0.) To 
make the initial value problem well-behaved, then, we must asume that S 
is non-characteristic, and we do so henceforth. 

It is natural to look at the integral curves of the vector field A (some- 
times called the characteristic curves of the equation (1.4)), that is, the 
parametrized curves x(t) that satisfy the system of ordinary differential 
equations 
(1.5) = = Ae), ie., “i = a;(2) (j =1,...,n). 


Along one of those curves a solution u of (1.4) must satisfy 


du dx; 
(1.6) HT Gap = Va dju= f bu. 


By the fundamental existence and uniqueness theorem for ordinary differen- 
tial equations (see, e.g., Coddington and Levinson [9]), through each point 
Zo of S there passes a unique integral curve z(t) of A, namely the solution 
of (1.5) with 2(0) = zo. Along this curve the solution u of (1.4) must be 
the solution of the ordinary differential equation (1.6) with u(0) = ¢(zo). 
Moreover, since A is non-characteristic, z(t) ¢ S for t # 0, at least for 
lt] small, and the curves x(t) fill out a neighborhood of S. Thus the ini- 
tial value problem for (1.4) is reduced to the initial value problem for the 
ordinary differential equations (1.5) and (1.6), and we have: 
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(1.7) Theorem. 

Assume that S is a hypersurface of class C} which is non-characteristic 
for (1.4), and that the functions a;, b, f, and ¢ are C} and real-valued. 
Then for any sufficiently small neighborhood Q of S in IR" there is a unique 
solution u € C! of (1.4) on Q that satisfies u = ¢ on S. 


This theorem is a special case of the corresponding result for quasi- 
linear equations, which we shall prove in detail below. 

What happens if we allow the quantities in (1.4) to be complex-valued? 
If we set aj = a} +iaj, and similarly for 6, f, and u, the real and imaginary 
parts of (1.4) are 


So ajdjul — So afdju? + bul — bu? = fi, 
So afajul + So ajaju? + bul + blu? = f?. 


This is a system of two equations for the two real unknowns u! and w?. If 
the a;’s and b are real, this system is uncoupled: the first equation involves 
only u! and the second only u?, so we can solve them separately and obtain 
the solution u = u! + iu? of (1.4), However, if the a;’s and b are complex, 
this system may possess no solutions, as we shall see in §1E. 

Now let us generalize to the quasi-linear equation 


(1.8) Y- aj(2, u)dju = b(z, u). 


If u is a function of x, the normal to the graph of u in R"t? is proportional 
to (Oiu,...,O,u, —1), so (1.8) just says that the vector field 


A(z, y) = (ai(z,y),.--,@n(x, y), b(z,y)) 


is tangent to the graph y = u(z) at any point. This suggests that we look 
at the integral curves of the vector field A in R®+! given by solving the 
ordinary differential equations 

dz; dy 

GED dt = b(z, y). 
It is clear that any graph y = u(r) in R"+! which is the union of an 
(n — 1)-parameter family of these integral curves will define a solution of 
(1.8). Conversely, suppose u is a solution of (1.8). If we solve the equations 


dz; a 2 
“y= aile ule), -2j(0) = (20); 
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to obtain a curve x(t) passing through zo, and then set y(t) = u(2z(t)), we 


have F 
> ey u— = => a,(z, u)O;u = b(z, u). 


Thus if the ek y = u(x) intersects an integral curve of A in one point 
(zo, u(xo)), it contains the whole curve. 

Suppose we are given initial data u = @ on a hypersurface S. If we 
form the submanifold 


S* = {(z,9(z)):2 €S} 


of R"+!, the graph of the solution should be the hypersurface generated 
by the integral curves of A passing through S*. Again, we need to assume 
that S is non-characteristic in some sense. This is more complicated than 
in the linear case because the coefficients a; depend on u as well as a, but 
the geometric interpretation is exactly the same: for z € S, the vector 


(ai(a, O(x)),..-,4n(z, ¢(x))) 


should not be tangent to S at z. (Note that this condition involves ¢ as 
well as S.) If S is represented parametrically by a mapping g : R"=! — R" 


and we take coordinates s),...,5,—1 on R"~}, this condition is just 
091/051 Hine 091/OSn-1 a;(g(s), 6(g(s)) ) 

(1.9) det #0. 
Ogn/O8, +++ Ogn/Osn-1 an(g(s), 6(9(s)) ) 


(1.10) Theorem. 
Suppose S is a hypersurface of class C! in IR" and a;, b, and ¢ are C! 
real-valued functions. Suppose also that the vector 


(a1(z, (2), -.-s4n(z, 6(2))) 


is not tangent to S at any c € S. Then for any sufficiently small neigh- 
borhood Q of S in R” there is a unique solution u € C? of (1.8) on Q that 
satisfies u= ¢ on S. 


Proof: The uniqueness follows from the preceding discussion: the 
graph of u must be the union of the integral curves of A in Q passing 
through S*. (Q must be taken small enough so that these curves are all 
distinct. That is, it can happen that an integral curve of A intersects S* 
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in more than one point, but in between these points of intersection it must 
leave 2.) Now, any hypersurface S can be covered by open sets on which 
it admits a parametric representation z = g(s). If we solve the problem 
on each such set, by uniqueness the local solutions agree on their common 
domains and hence patch together to give a solution for all of S. It therefore 
suffices to assume that S is represented parametrically by 2 = g(s). 

For each s € R®~!, consider the initial value problem 


Fi(s,t)=aj(2,y),  $4(5,t) = 0(2,y), 
2;(s,0)=9;(s), —_-y(s,0) = 4(9(s)). 


Here s is just a parameter, so we have a system of ordinary differential 
equations in t. By the fundamental theorem of ordinary differential equa- 
tions (Coddington and Levinson [9]), there is a unique solution (zx, y) de- 
fined for small ¢, and (z,y) is a C! function of s and ¢ jointly. By the 
non-characteristic condition (1.9) and the inverse mapping theorem, the 
mapping (s,t) + 2(s,t) is invertible on a neighborhood Q of S, yielding s 
and ¢ as C! functions of z on 2 such that t(x) = 0 and g(s(x)) = 2 when 
z €S. Now set 


u(z) = y(s(z), t(z)). 


Clearly u = ¢ on S, and we claim that u satisfies (1.8). Indeed, by the 
chain rule, 


yaa = Ou O54 Ou At 
244 = 245\ 24 O55 Fz; * Bt 83; 


j= j=l 
n-1 n 
Ou Os, Ou ot 
= _ a +— 
n—-l n 
- Ou Os, Oxy | Ou Ox; Ot 
2 Fs, 2 oe; at +a at Oz; 
= j=l 
= 57 Ou Ose, Oude 
Parr dsp Ot | Ot Ot 
Ou 
=0+ a = b, 


since s, and t are functionally independent. This completes the proof. 


Let us see a couple of examples; others will be found in the exercises. 
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Example 1. 
In R3, solve x,0,u + 22202u + O3u = 3u with u = (21,22) on the plane 
w3> 0. 


Solution: We solve the ordinary differential equations 


dey _ dz2 os des _ au sy 
dt = 71, “a 2s — i, = 


dt 
with initial conditions 
(x1, 22, £3, U)|t=0 = (51, 52, 0, $(s1, 52)), 
obtaining 
£1 = sje, ro = se", t3=t, u = 4(51, s2)e*. 
We solve the first three equations for s1, s2, and t, obtaining 


t= 23, S$, =z e"7°, Sg = T2e7 


2r3 
The solution is then 
u= ¢(z1e"7°, zoe???) e373, 
Example 2. 
In R?, solve ud,u + O.u = 1 with u = ds on the segment 21 = 22 = 5, 
O<s<l. 


Solution: First, (1.9) is satisfied, for 


O2,/8s ay(s,s,4s)\ _ 1 ds\_ 1 
act (Geis ax(s,s, ts) = det 1 4 =1l— 58s#0for0<s<1l. 
We solve the equations 

dz, 7 dzr2 du 

dt ~ cs 
with initial conditions 

(x1 »22, u)li=0 = (s, s, is), 
obtaining 
u=t+4s, t2=t+5, z= it? +hst+s. 

Since r2— 21 = 4é(2—t—s) = $t(2— 22), we can easily eliminate s and ¢ 
from these equations to obtain 


_ 4a2 — 201 — 23 
2(2 = Z2) 
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The initial value problem for a general real first order equation can also 
be reduced to the initial value problem for a system of ordinary differential 
equations. We shall derive these equations by assuming that a solution 
exists and working backwards. 

Suppose, then, that we are given a real-valued function F of the 2n+1 
variables 21,...,2n,Y,21,)---,;2n- (The appropriate smoothness assump- 
tion here is F € C?.) Suppose that we have a function u of z such that 
when we set y = u(x) and z; = 0;u(z), the equation 


(1.11) F(2\,...,2n,Y,21,+--)2n) =0 
is satisfied identically in z, that is, u is a solution of 
(1.12) F(z1,...,2n,u, Oyu, ...,Onu) = 0. 


In what follows we stipulate that the function F and its derivatives are to 
be evaluated at y = u(z), 2; = Oju(zx). 

First, consider the integral curves of the vector field V,F, i.e., the 
solution curves of the equations 


dr; OF 
(1.13) = dae 
Along these curves the quantity y must satisfy 
dy _ Ou dzj _ OF 


In the quasi-linear case (1.8), OF /9z; = a; and )>z;(OF/0z;) = 6 are 
independent of the z;’s, so (1.13) and (1.14) form a determined system 
in the variables 21,...,2n,y — in fact, the system in used in the proof 
of Theorem (1.10). For the general case we also need equations for the 
variables z;, which can be obtained as follows. From (1.13), we have 


oA Oz; dz, Ou Oz, OF 
= Ox, dt = ae a Ox; Oz, a = ae Ox; Oz,_ 
Also, differentiating (1.11) with respect to z;, 


_ OF pid Oy OF Oz, _ oF OF Oz, OF 
~ Oz; * ‘Oy Oz, ap Deon Oz, 02; Oz; de; * 73 Oy +2 Ox; Oz," 
Combining these equations, we have 


dz; 
(1.15) oii OF OF 
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Dropping now the assumption that a solution u is given, we see that (1.13), 
(1.14), and (1.15) form a system of 2n + 1 ordinary differential equations 
for the 2n + 1 unknowns g,, y, and z;. 

As for the initial conditions, we assume given a C! hypersurface S 
represented parametrically as s = g(s) and a C function ¢ on S. Then 
the initial conditions for (1.13) and (1.14) are 


(1.16) x(s,0)=g(s), ys, 0) = 4(9(s)). 


Moreover, on S we must have 


dy _ 94(9(s)) dy Sgn _ Ogx 
(1.17) 08; Os; ao as; = ae, (l<j<n-1), 


F(g(s), $(9(s)), z3(g9(s))) = 0. 


These provide n equations in the n unknowns z1,..., Zn on S. In the quasi- 
linear case these equations are linear, and the non-characteristic condition 
(1.9) ensures the existence of a unique solution. In general we have neither 
existence nor uniqueness (for example, a square root may appear). We 


must therefore assume given C' solutions z1,...,2n of (1.17) on S which 
satisfy the non-characteristic condition 
goo Pe gS 3, Ol), (as). 45 (9(s))) 
det}: : #0. 
Be ee SE) #(9(8)), z;(9(s))) 


We can then proceed as before. We solve the system (1.13-15) with initial 
conditions (1.16-17) to obtain 2;, y, z; as functions of s and ¢. The non- 
characteristic condition ensures that the mapping (s,t) — 2(s,¢) can be 
inverted to yield s and ¢ as C functions of z. We then claim that u(z) = 
y(s(x),t(x)) satisfies (1.12): it clearly satisfies u = ¢ on S because of 
(1.16). To prove the claim, one uses the uniqueness theorem for ordinary 
differential equations and the chain rule to verify that 


F(x1,...,2n,u(z), 21(s(z), t(z)),..., 2n(s(z), t(2))) = 0, 
z;(s(z), t(z)) = dju(z). 


There is much more to be said about fully nonlinear first order equa- 
tions. More complete discussions, including geometric interpretations and 
some applications, can be found in Courant and Hilbert [10, vol. I] and 
John [30]. 
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EXERCISES 


The following problems deal with differential equations on R?. We write 
z,y and 0,, dy rather than 21,22 and 0,,, Oz,. 


1. Solve Osu + Oyu = u with u = cosz when y = 0. (Answer: u = 
eY cos(z — y).) 


2. Solve c70,u + y?Oyu = u? with u = 1 when y = 2z. (Answer: u = 
zy/(zy —y + 2z).) 


3. Show by geometric considerations that the general solution of 20,u — 
yOyu = 0is u= f(zy). Find the solution whose graph contains the line 
u=az=y. (Answer: u = \/fy.) What happens to the initial value 
problem when S is the curve y = 2~1? 


4, Solve ud,u + dyu = 1 with u = 0 when y = z. (Answer: u = 1 — 
(1 + 2z — 2y)/?,) Something nasty happens if we replace the initial 
condition u = 0 by u= 1. What? 


C. The General Cauchy Problem 
We now return to the general kth order equation 
(1.18) F(z, (8% u)jal<k) =0, 


where F will always be assumed to be (at least) C!. The most natural 
generalization of the initial value problem for first order equations, which 
corresponds to the initial value problem for higher order ordinary differen- 
tial equations, is the Cauchy problem. 

Let S be a hypersurface of class C*. If u is a C*~} function defined 
near S, the quantities u, O,u, .. . OF-ly on S are called the Cauchy data 
of uon S. (Recall that 0, is the normal derivative on [a neighborhood of] 
S: see (0.3).) The Cauchy problem is to solve (1.18) when the Cauchy 
data of u on S are preassigned. 

In this section all our considerations will be restricted to a neighbor- 
hood of a given point on S. We may therefore assume that a change of 
coordinates has been made so that S contains the origin and, near the ori- 
gin, coincides with the hyperplane z, = 0. With this in mind, it will be 
convenient to make a slight change of notation. We shall consider R® as 
IR"! x R and denote the coordinates by (x,t) where x = (z1,...,2n-14). 
Derivatives with respect to the z variables will be denoted by 0,, or OF, 
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where a@ = (aj,...,@n-1), and derivatives with respect to ¢ will be denoted 
by 0]. We can then restate the Cauchy problem as follows: Given functions 
go, .-+, k-1 Of x, solve 


(i 19) F(z,t, (aga U)lal+i<k) > 0, 
GHu(z,0)=9j(c) (SF <4). 

We observe to begin with that if u is a function of class C’ with r > k, 
then the Cauchy data {¢;} determine all derivatives 070) u on S with j < k 
and |a|+j <r; in fact, 


02 Of u(x, 0) = 974; (2). 


Hence the only quantity in the differential equation in (1.19) which is un- 
known on S is O*u. In order for the Cauchy problem to be well-behaved, 
we must assume that the equation F = 0 can be solved for OF u. 

In the linear case, 


F(a,t, (82 3 u)jalsi<k) = 1% Gq; (x, t)dZ0;u — f(z,t), 
laltj<k 


this assumption just means that S is non-characteristic. Indeed, “S is non- 
characteristic” means that aox(z,0) # 0, hence by continuity aoz(z,t) # 0 
for small ¢, and we can solve for OFu: 


ous (a04)"*| SS aaj O28 u ~ i] 


lalssk, i<k 


Here are some examples of the bad behavior that can occur when this 
condition is not satisfied: 


i. The line t = 0 is characteristic for the equation 0,0;u = 0 in R?. If u is 
a solution of this equation with u(z,0) = ¢o(z) and O,u(z, 0) = ¢1(z), 
we must have 02¢1 = 0, i.e., d; is constant. Thus the Cauchy problem 
is not solvable in general. On the other hand, if ¢; is constant, then 
there is no uniqueness: we can take u(x,t) = ¢o{z) + f(t) where f is 
any function with f’(0) = ¢1. 


ii. The line t = 0 is characteristic for the equation 02u — 0,u = 0 in R?. 
Here if we are given that u is a solution with u(z,0) = ¢o({z), then 
Q,u(z,0) is already completely determined: 0;u(z,0) = ¢9(z). 
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In the quasi-linear case, 
F(z,t, (OSHu)ojssse) = Yl day (x,t, (OF u)pisicn-1) 050 u 
jaltj=k 


— b(2,t, (2 Ifu)pisece-1) 


we say that the Cauchy problem (1.19) is non-characteristic if 


ak (x, 0, (82 4;(x))\sj4ich—1) #0 


for all x; again, this allows us to solve for the derivative OF u. 
In the general case, the equation 


F (2,0, (026;(2))jaltick, j<k Yor) = 0 


will usually not determine uo, uniquely as a function of z on S. Therefore, 
we phrase the non-characteristic condition as follows: The quantity uo, can 
be determined as a C! function of z on S so that 


F(z,0, (82 6;(2))jal4j<k, j<k» Uor(Z)) = 0, 


OF 
Duce” 0, (874; (2) alain, j<k Uor(2)) #0 


for all x. In this case, we can solve the equation F = 0 for uo, as a 
C} function G of the remaining variables near S, by the implicit function 
theorem, and write the differential equation in the normal form: 


(1.20) Ofu = G(x, t, (O28 u)alassk, i<k)- 


The Cauchy data {¢;} together with (1.20) determine all derivatives 
of u of order < k on S. If G is sufficiently smooth, we can also determine 
higher derivatives of u. Namely, differentiating (1.20) with respect to ¢, 


abu = SE 4 3 ac. eatty, 


laltisk,j<k 


All the quantities on the right are known on S, so 0! +1u is also; hence we 


know all derivatives of u of order < k+10n S. Applying 0; more times, 
we obtain higher derivatives. In particular, we have: 


(1.21) Proposition. 

Suppose that G, $0, ...,¢z-1 are analytic functions. Then there is at most 
one analytic function u satisfying (1.20) such that 0] u(z,0) = 4;({z) for 
O<y<k. 
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Proof: By Taylor’s formula, an analytic function is completely deter- 
mined by the values of its derivatives at one point. t 


Actually, the smoothness of G is unnecessary for the unique determi- 
nation of the derivatives of u on S. 


(1.22) Proposition. 
Suppose G is continuous and there is a constant C > 0 such that for all 
z,t € R” and all vectors (ua;), (vaj) (0O< Jaol+j7<k, 7 <b), 


|G(z, t, (ues) — G(x, t, (va3))] $ C > [taj — vas. 
aj 


If u and v are two solutions of (1.20) with the same Cauchy data on S, and 
the derivatives 020) u and O20} v exist for ja| < q andj <r (q,r > k), 
then these derivatives agree on S. 


Proof: Let w = u—v. It suffices to show that 0%w = 0 on S for 
m <r, as then the z-derivatives of these functions also vanish on S. We 
proceed by induction on m, the cases m < k being true by assumption. 
Suppose then that m > k and dfw = 0 on S for i < m. By Taylor’s 
theorem, 


ym-k = 
(1.23) OF w(az,t) = joe es o(t™~*) (t +0), 
and for j < k and {al+ 7 < k, 
j end aam -j ma 
875; w(x, t) = apron w(x,0) + o(t™-3) = o(t™-*) (t+ 0). 
(See Folland [16] for the form of Taylor’s theorem used here, which is omit- 
ted from most calculus books.) Therefore, by the assumption on G, 


|akw(z,t)| = |G(z,t, (OFA u(x,t) — G(z,t, (AZaf v(x, t)))| 
<C> 028 u(x,t) — 028 v(z, t)| 
=C) 02H w(z,t)| = oft-*) (t+ 0). 
Hence, by (1.23), 
pn-k 
(m—k)! 
which forces 07° w(z,0) to be 0. i] 


OP w(z,0) = o(t™—*) (t — 0), 
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Although the Cauchy problem is good from the point of view of de- 
termining a unique solution, existence is another matter, especially if we 
want a solution in a specified domain and not just in some neighborhood 
of the initial hypersurface S. In fact, the Cauchy problem tends to be 
overdetermined except in certain special situations. As we shall see, the 
appropriate boundary conditions for a differential equation depend strongly 
on the particular form of the equation. 


D. The Cauchy-Kowalevski Theorem 
In this section we consider the Cauchy problem 
F(z,(0%u)jaick) =0, BHu=gjonS (0<j<h), 


where the functions F, @o,...,@%—1 and the hypersurface S are assumed to 
be analytic, and we look for solutions defined in a neighborhood of some 
point zo € S. As in §1C, we can make an analytic change of coordinates 
from R® to R"~! x R so that xo is mapped to (0,0) and a neighborhood 
of zo in S is mapped into the hyperplane ¢t = 0. This transformation will 
of course change the function F, but the result will still be analytic. We 
assume the non-characteristic condition in its analytic form, that is, that 
the equation F = 0 can be solved for OFu to yield OFu as an analytic 
function G of the remaining variables. The Cauchy problem then takes the 
form 


(1.24) du = G(z,t, (IFA upassce, ick)» 
Hu(z,0)=4;(z) (OS 5 <4). 


Our main result is the following fundamental existence theorem. 


(1.25) The Cauchy-Kowalevski* Theorem. 

If G, ¢0,-..,¢k-1 are analytic near the origin, there is a neighborhood 
of the origin on which the Cauchy problem (1.24) has a unique analytic 
solution. 


* The problem of how to spell this name is vexed not only by the usual lack of a 
canonical scheme for transliterating from the Cyrillic alphabet to the Latin one but also 
by the question of whether to use the feminine ending (-skaia instead of -ski). The 


spelling used here is the one preferred by Kowalevski herself in her scientific works. 
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We remark that G, ¢;, and u can be complex-valued here, or, for that 
matter, vector-valued: the arguments in this section work equally well for 
determined systems of equations. Before proceeding to the proof, we recall 
some properties of power series in several variables. (The proofs are the 
same as in the one-variable case.) 


(1.26) If f is analytic near 2° € R”, there is a cube Q = {z: |zj — 23] < 
r for 1 <j <n} on which the Taylor series 


0° f(z°) O\a 
ye yp (@—2) 
z a! 
converges to f(x). The convergence is absolute and uniform on compact 
subsets of 2, and the series can be differentiated termwise. 


(1.27) Let f(z) = )> aq(z—2°)® be convergent near x = 2° € R”. More- 
over, let z be an analytic function of & € R™: x = D> bg(é — €°)? where 
bg € R" and x(€°) = bo = x°. Then the composite function F(€) = f(x(€)) 
is analytic at €°, and its power series expansion about €° is obtained by 
substituting p49 ba(€ — €°)* for z — 2° in the series )> g(x ~ 2°)® and 
multiplying out. Thus F(é) = S>cy(€ — €°)? where c, = P,(aq’s, bg’s) 
and P, is a universal polynomial (independent of the particular series in- 
volved) in those coefficients ag and bg for which a; < 7; and @j < 7; for 
all j7. Moreover, P, has nonnegative coefficients since only addition and 
multiplication are involved in expanding out the substitution. 


(1.28) Given M,r > 0, the function 


Mr 


IG) ahaa) 


is analytic in the rectangle {x : max|z;| < r/n}. By the geometric series 
formula and the multinomial theorem, its Taylor series is 


oes — (a1 +---+2n)* _ la! a 


which converges absolutely for max|z;| < r/n. 


(1.29) A power series )> a,(z—2°)* with nonnegative coefficients is said 
to majorize another series )- ba(z--°)® if aa > [ba| for all a. In this case 
it is clear that > b,(z—2x°)* converges absolutely whenever )~a4(z—z°)* 
does. 
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(1.30) Suppose }> agzx° is convergent in a rectangle {z : max|z;| < R}. 
Then a geometric series as in (1.28) can be found that majorizes )~a,z*. 
Indeed, let us fix r withO <r < R. Setting z = (r,r,...,r), we see that 
Yi agrl@l converges, so there is a constant M such that jagr!@l| < M for 
all a. Hence < Mla! 
a: 
|aa| < ylel < atrial . 

We now return to the Cauchy-Kowalevski theorem. We have proved 
uniqueness of the solution in Proposition (1.21), and the discussion there 
suggests how to prove existence: determine all the derivatives of u at the 
origin by differentiating (1.24) and plug the results into Taylor’s formula. 
The problem is to show that the resulting power series converges. To this 
end, it is technically convenient to replace our kth order equation by a first 
order system. 


(1.31) Theorem. 
The Cauchy problem (1.24) is equivalent to the Cauchy problem for a 
certain first order quasi-linear system, 


n-i 


= j t BS ,t,Y), 
(1.32) aY LAile, ,Y)de,¥ + Bla, t,Y) 


Y(z,0) = &(z), 


in the sense that a solution to one problem can be read off from a solution 
to the other. Here Y, B, and ® are vector-valued functions, the Aj’s are 
matrix-valued functions, and A;, B, and ® are explicitly determined by 
the functions in (1.24). 


Proof: The vector Y is to have components (yaj), 0 < laj+ 7 < k. 
In what follows it is understood that a20u is to be replaced by yo; as an 
independent variable in G. Also, if a@ is a nonzero multi-index, 7 = i(a) 
will denote the smallest number such that a; 4 0, and 1; will denote the 
multi-index with 1 in the ith place and 0 elsewhere. The first order system 
to be solved is 


(a) OYaj = Ya(j+1) (ja| +g9< k), 
(b) OYoy _ Or, Ya—1;)G4+1) (la| +j _ k, j < k), 


dG oG 
1.33 (c) Oryor = = + y Fo Yast) 
(1.33) at ies OYaj 


0G 
+ > Bua TiMe- G+) 
lol4j=k, j<k 7% 
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and the initial conditions are 


(a) yaj(z,0)= OF ¢j(z) (J <k), 


1.34 
a (b) you (2,0) = G(z, 0, (824;(2))al4ice, p<k)- 


It is clear that if u is a solution of (1.24), then the functions y,; = az! u 
satisfy (1.33) and (1.34). Conversely, if the y.j’s satisfy (1.33) and (1.34), 
we claim that u = yoo satisfies (1.24). This is nontrivial and involves the 
initial conditions in an essential way. 

First, (1.33a) implies that 


(1.35) Yolj+t) = Oyas (J +E<k), 
and (1.33b) then implies that for |o|+j = k andj <k, 
Oya; = 1Oe:Yia-14)j3 
so that 
Yoj (2, t) = As, Ya~t,)j (tt) + Cas (z) 
for some function cg;. But by (1.34a), 
Yoj(#,0) = O29; (2) = O2,02- "6; (x) = O,y(a—1,);(2, 0), 
so that ca; = 0, and we have 
(1.36) Yoj = O2,Ye-1)3 (lalt+j=h, 5 < k). 
Next, by (1.33c), (1.35), and (1.36), 
aG 9G Byaj 


Oryork = => + 
ae lal4j<k, i<k AYaj ot 


= S[G(e,4, (vas))]; 


so that 
yor(z, t) = G(z,t, (Ya;(z,t))) + cor(z) 


for some function cox. But by (1.34), 
yor(z,0) = G(x, 0, (87¢;(2))) = G(z,0, (yas(2,0))), 
so again co, = 0, and we have 


(1.37) yor = G(z,t, (Yas )ialtick, i<k): 
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Finally, we propose to show that 


(1.38) Yoj = 92; Ya-1,)j (a # 0). 


The proof proceeds by induction on k — j — Ja}, the initial step k = j + |a| 
being established in (1.36). By (1.38), (1.35), and the inductive hypothe- 
sis, 
OVasi = Yaj+1) = 92, Ya-1,5+1) = 89s, Ya-1,)i 
so that 
Yor (2, t) = Or, Y(a-1,)5 (zt) + Cas (2). 
But by (1.34a), 


Yo (x, 0) = O26;(x) = Op,02~ "6; (2) = Oe, Ha—1,); (2, 9). 


Hence ca; = 0, and (1.38) is established. 
Now we are done. Applying (1.35) and (1.38) repeatedly, we find that 


(1.39) Yo; = O21 yoo, 
and then by (1.39), (1.37), and (1.34a), u = yoo satisfies (1.24). 1 
We need one further minor simplification. 


(1.40) Theorem. 
The Cauchy problem (1.32) is equivalent to another problem of the same 
form in which ® = 0 and A,,..., An—1, B do not depend ont. 


Proof: To eliminate ® we simply set U(x,t) = Y(z,t) — (x). Then 
Y satisfies (1.32) if and only if U satisfies 


n-1 


aU = > Ai(x,t,U)0.,U + B(x,t,U), — U(x,0) = 0, 
1 


where 


Ai(x,t,U) = Ai(x, t, U +), 
n-1 


B(z,t,U) = B(x, t, U+%)+ > Ai(a, t, U + )02,0. 
1 


Next, to eliminate t from A; and B we merely add on extra component u° 
to U satisfying 0,u° = 1 and u°(z,0) = 0. Then u® = ¢, and we can replace 
t by u° in A; and B by adding the extra equation and initial condition. 1 
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In Theorems (1.31) and (1.40) there is no harm in assuming that all 
functions in question are real-valued, as a C¥-valued function can be re- 
garded as an R?_valued function. Since the constructions in these theo- 
rems clearly preserve analyticity, we have reduced the Cauchy-Kowalevski 
theorem to the following: 


(1.41) Theorem. 

Suppose that B is an analytic R'-valued function and A;,...,A,—1 are 
analytic N x N real matrix-valued functions defined on a neighborhood of 
the origin in R" x R%. Then there is a neighborhood of the origin in R" 
on which the Cauchy problem 


n-1 


(1.42) On = » Ai(z, Y)02,¥ + B(z,Y), 


Y(z,0) =0 


has a unique analytic solution. 


Proof: Let Y= (y1,- UN); B= (b,...,bny), Ai = (ai) tat: We 
seek solutions ym = )>c®/x2°%t? (m= 1,...,N) of (1.42). The Cauchy data 
tell us that c®° = 0 for all a, m. To determine c%) for j > 0, we substitute 
these power series into the differential equations 


(1.43) Ym = So aki(z,u, 6p YN)Oz, Yt + Om (2, y1,..-, YN): 
il 


By (1.27), substituting the series for the y;,’s into a‘, yields a power series 
in « and ¢ whose coefficients are polynomials with nonnegative coefficients 
in the ci and the coefficients of the Taylor series of at). Moreover, the 
coefficients of the terms in which ¢ occurs to the jth power only involve 
the c?’ with | <j. The same is true of the series obtained from b, and 
that obtained from 0;,y:, and multiplying a‘,, by O2,y: still preserves these 
properties. 

In short, on the right side of (1.43) we obtain an expression of the form 


be (ef i<j, coeff. of A; and B)x%t/, 


aj 


where P°% is a polynomial with nonnegative coefficients. On the left side, 
we have 
TG + ese 


aj 
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Hence, 
cH 43) — (5 4.1) PO (cf ic, coeff. of Aj and B), 


so if we know that cf with | < j we can determine the cf! with l=j+1. 
Proceeding inductively, we determine all the cB and more precisely we 
find that 


c#} = Q°d (coeff. of A; and B), 


where Q@/ is again a polynomial with nonnegative coefficients. 
Now suppose we have another Cauchy problem 


(¥ again being R%-valued) for which 

(a) we know that an analytic solution exists near (0,0); 

(b) the Taylor series of A; and B majorize those of A; and B. 

Then the solution Y = ({1,..-,¥n) of this problem is given by jm = 
> cai z*t?, where 


i = Q2) (coeff. of A; and B), 


QJ being the same polynomial as before. Since Q°) has nonnegative 
coefficients, 


jc02{ = |Q2/ (coeff. of A; and B)| < Q%) (coeff. of A; and B) = @). 


Hence the series for Y majorizes the series for Y, and the latter therefore 
converges in some neighborhood of (0,0). 

It is easy to construct such a majorizing system. Indeed, if M > 0 is 
sufficiently large and r > 0 is sufficiently small, by (1.30) the series for A; 
and B are all majorized by the series for 

PE ee ee: AR Oe Pe eee 
r—(ay+...+2n-1)—-(yi+.-.+ yn) 


Thus we consider the following Cauchy problem: for m= 1,...,N, 


Mr 
in SS TT ba mS 


Ym(x,0) = 0. 


(1.44) 
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A solution to this problem is readily found. In fact, if we solve the Cauchy 
problem 


Mr 
= ———_. - 1)d, 1], 
(1.45) uu mee Gar )0,u+ 1] 
; u(s,0) =0 
for one (scalar) unknown in two (scalar) variables s and t, and then set 


yj(z,t) = u(ait+...+ tp-1, t) (g=1,...,N), 


then Y = (y1,..., yn) will satisfy (1.44). 
But we have already seen how to solve (1.45) in §1B. Rewriting the 
differential equation as 


(r—-s—Nu)d,u— MrN(n-—1)0,u= Mr, 
we solve the ordinary differential equations 
= =r—s-Nu, i MN), —=- = Mr 
dr 
with initial conditions 
1(0) = 0, s(0) =o, u(0) = 0, 
and find that 
t= 4MrN(n—2)r? +(r—o)r, s=~-—MrN(n-1)r+e, us Mrr. 


Eliminating o and 7 yields 


wat) = 28 VE 


Mn 


(The minus sign on the square root is forced by the condition u(s,0) = 0.) 
Clearly this is analytic for s and ¢ near 0, so the proof is complete. 1 


We conclude with a few additional remarks concerning the Cauchy- 
Kowalevski theorem. First, the theorem asserts the existence of a unique 
analytic solution in a neighborhood of one point. However, given analytic 
Cauchy data on an analytic hypersurface S, there is a analytic solution 
near any point on S, and by uniqueness any two of these solutions must 
agree on their common domain. Hence we can patch them together and 


54 Chapter 1 


obtain a solution on a neighborhood of S. In general, the neighborhood 
will depend on the particular problem. (See Exercise 1.) 

Second, there remains the question of whether the Cauchy problem 
(1.24) might admit non-analytic solutions as well. In the linear case, the 
answer is negative: this is the Holmgren uniqueness theorem. The proof 
can be found in John [30], Hormander (26], (27, vol. I], or Treves [52]. 

A major drawback of the Cauchy-Kowalevski theorem is that it gives 
little control over the dependence of the solution on the Cauchy data. Con- 
sider the following example in R?, due to Hadamard: 


OFu+ O2u = 0, 


(1.46) 
u(z,,0)=0, Ogu(x1,0) = ke" sin kay, 

where & is a positive integer. This problem is non-characteristic since 

0? + 82 is elliptic, and one easily checks that the solution is 


u(z1,22) = e~V¥ (sin kz,)(sinh kr2). 


As k — co, the Cauchy data and their derivatives of all orders tend uni- 
formly to zero since e-VE decays faster than polynomially. But if z2 4 0, 
lime~Y* sinh kay = 00, so as k — oo the solution oscillates more and more 
rapidly with greater and greater amplitude, and in the end it blows up 
altogether. The solution for the limiting case k = 00 is of course u = 0. 
This example shows that the solution of the Cauchy problem may not de- 
pend continuously on the Cauchy data in most of the usual topologies on 
functions. 

The proof of the Cauchy-Kowalevski theorem obviously depends in an 
essential way on the assumption of analyticity. In the linear case, there is 
a more general version of the theorem that requires analyticity only in z, 
not in t; see Treves [52]. However, the analyticity hypothesis cannot be 
discarded completely, as we shal] see in the next section. 


EXERCISES 


1. Let S be the unit circle in the complex plane. A function ¢ on S is 
analytic if and only if it is the restriction to S of a holomorphic function 
on an annulus Ags = {z : a < [z| < b} where a < 1 < 5; in this case, 
if 7, Cme’™® is the Fourier series of ¢, 5°, ¢mz™ is the Laurent 
series of its holomorphic extension to A. 
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a. Use this fact to find an explicit solution of the Cauchy problem for 
the Laplacian on S: 


Au = 0 ona neighborhood of S, u=¢@ and 0,u=wonS. (+) 


Here z = re’®, so 0, is the normal derivative on S, and ¢ and ¥ are 
analytic on S. (Hint: z™ = r™e'™® and z-™ = r~™e'™? are both 
harmonic functions that agree with e'™® on S.) 

b. Show that if a < 1 < 6, there exist ¢ and y for which the solution 
to (*) exists on the annulus A,, but cannot be extended beyond 
Ag». (Hint: for any disc D there are holomorphic functions on D 
that cannot be extended holomorphically beyond D.) 


2. Carry out explicitly the reduction of the Cauchy problem for Laplace’s 
equation Au = f to a first order system as in Theorem (1.31), taking 
S to be the hyperplane zy, = 0. 


3. To see how the ideas in the proof of the Cauchy-Kowalevski theorem 
work in a simpler setting, prove the following theorem about ordi- 
nary differential equations in the complex domain. Suppose p(z) = 
So Pmz™ and q(z) = )-9 gm2™ are holomorphic in the disc |z| < R, 
and consider the initial value problem 


ul” = p(z)u’ + q(z)u, u(0)=co, u’(0) =c1. (4x) 
a. Show that if u(z) = )°>° cmz™ satisfies (x*), then 


1 


om+2 = Um + 2)(m + 1) 


iG + 1)¢j4.1Pm-j + Cj 9m—j] 
0 
so that the coefficients c,, are uniquely determined by (++). 

b. Suppose Pm > |[pm| and Qm > |¢ml for all m > 0, and let P(z) = 
Yo Pm2z™, Q(z) = To’ Qmz™. Show that if U(z) = 9’ Cnz™ 
satisfies U” = P(z)U’ + Q(z)U, and Cy > [eo| and Cy > |e,|, then 
Cm > |em| for all m. 

c. Suppose r < R. Show that the conditions of (b) are satisfied if we 
take Py = Kr-™ and Qm = K(m+1)r~™ for K sufficiently large, 
so that P(z) = (1—r7!z)~! and Q(z) = (1 —r71z)~?. Show also 
that the general solution of U” = P(z)U’ + Q(z)U in this case is 
a linear combination of (1 — r~!z)* and (1 — rz)? for suitable 
exponents a and f. 

d. Conclude that (+*) has a unique holomorphic solution u in the disc 
[z| < BR. 
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E. Local Solvability: The Lewy Example 


It was long believed that any “reasonable” partial differential equation 
(with no boundary conditions imposed) should have many solutions. In 
particular, suppose we have a linear equation halk a,0°%u = f where f 
and the coefficients ag are C™. Given zp € R", can we find a solution u 
(not necessarily C™) of this equation in some neighborhood of zo? If f 
and the aq’s are analytic and ag(xo) # 0 for some a@ with ja| = k, the 
Cauchy-Kowalevski theorem shows that the answer is yes. Indeed, we can 
choose a vector € which is non-characteristic for }>a,0% at zo (and hence 
at all x in a neighborhood of x9) and solve the Cauchy problem with zero 
Cauchy data on the hyperplane through zo orthogonal to €. 

One might well expect that the assumption of analyticity can be omit- 
ted. But in 1957 Hans Lewy [33] destroyed all hopes for such a theorem 
with the following embarassingly simple counterexample. Consider the dif- 
ferential operator Z defined on R3 with coordinates (z, y,t) by 
(1.47) L= Foti Bile tw) 5. 

(1.48) Theorem. 

Let f be a continuous real-valued function depending only on t. If there is 
aC! function u of (z,y,t) satisfying Lu = f in some neighborhood of the 
origin, then f is analytic at t = 0. 


Proof: Suppose Lu = f in the set where x? + y? < R? and |i} < R 
(R > 0). Set z = 2+iy; write z in polar coordinates as re and set 5 = r?. 
Consider the quantity V, a function of ¢ and r (or equivalently of ¢ and s) 
defined for 0 < r < R and |t| < R by the contour integral 


2a 
Va i} u(z,y,t)dz= ir f u(r cos 9, rsin@, t) e'? do. 
{zl=r 0 


By Green’s theorem, 


Ou 
vaiff E +i54| z,y,t)drd 
* Ines (82 (x, y,t)dzdy 


26 
=f i) E +i] (pcos @, psin @, t) pdpdé. 


Hence 


Or 


outs 
i de E +i5*] (zy, t) rf 


waif” [s+ | (r cos 8, rsin§, t)rdé 
0 Oz 
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The equation Lu = f then implies 


av 1 OV du du dz 
Gs ~ Or OF ~ Iter [s+ Se (uy 


i a dz Ow . 
= he. Budde + f f(t) a tart nif(t). 


Thus if we set F(t) = in f(r) dr, the quantity U(t,s) = V(t,s) + F(t) 


satisfies 
OU AU 


ot as ~ 
This is the Cauchy-Riemann equation, so U is a holomorphic function of 
w=t-+is in the region 0 < s < R?, |t| < R, and U is continuous up 
to the line s = 0. Moreover, V = 0 when s = 0, so U(0,t) = wF(t) is 
real-valued. Therefore, by the Schwarz reflection principle, the formula 
U(t, —s) = U(t, s) gives a holomorphic continuation of U to a full neigh- 
borhood of the origin. In particular, U(t, 0) = F(t) is analytic in t, hence 
so is f = F". i 


There is really nothing special about the origin in Theorem 1.48. In 
fact, a change-of-variable argument that we outline in Exercise 1 shows 
that for any (zo, yo, to) € R°, the equation 


Lu(z, y,t) = f(t + 2yox — 2zoy) 


has no C" solution in any neighborhood of (zo, yo, to) unless f(r) is analytic 
at r = to. 

Once this is known, it is not hard to show that there are C™ functions 
g on R3 such that the equation Zu = g has no solution u € C!+® (@ > 0) in 
any neighborhood of any point. The idea is as follows. Pick a C™ periodic 
function f on R that is not analytic at any point, and pick a countable dense 
set {(z;, y;,t;)}9° in R3. Then there is a sequence of positive constants {c;} 
tending to zero rapidly enough so that the series 


co 
ga(z,y,t) =) aye; f(t + 2yjx — 2a;y) 
j=l 


defines a C™ function on R* for any bounded sequence a = {a;}. One can 
then show that for “most” sequences a, in the sense of Baire category in 
the space [° of bounded sequences, the equation Lu = gq has no C!+@ 
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solution near any point. For the details of this argument, see Lewy [33] or 
John [30}. 

This construction also leads immediately to an example of a homoge- 
neous equation with no nontrivial solution. Namely, if f is a C@ function 
on R3 such that the equation Lu = f has no solution near any point, then 
the equation Lu— fv = 0 has no solution except v = 0. Indeed, suppose v is 
a solution that is nonzero on an open set 2. By shrinking 2 we can assume 
that a single-valued branch of the logarithm can be defined on v(2), and 
then u = log v is a solution of Lu = f on Q. 

A couple of years after Lewy proved Theorem (1.48), Hormander em- 
bedded it into a more general result that initiated the theory of local solv- 
ability of differential operators. We make a formal definition: A linear 
differential operator L with C™ coefficients is said to be locally solvable 
at xo if there is a neighborhood Q of zo such that for every f € CS°(Q) 
there exists u € D/(Q) with Lu = f. Hormander’s theorem is then as 
follows; see [26] for the proof. 


(1.49) Theorem. 
Let L be a linear differential operator with C™ coefficients on Q, let 
P(z,&) = xx(2,€) be the characteristic form of L, and let 
d & = 
OP oP oP oP 
Q(z,€) = x [Fees nO) - Fen (6) ge (e.6)| . 
a. If zo € 2 and there is a€ € R® such that P(xo,£) = 0 but Q(zo,€) #0, 
then L is not locally solvable at xo. 
b. If for each x € 2 there isa&é € R” such that P(x, £) = 0 but Q(z, €) £0, 


then there is an f € C™(Q) such that the equation Lu = f has no 
distribution solution on any open subset of 2. 


It is easy to check that the Lewy operator (1.47) satisfies the hypothesis 
of (b) on 2 = R*. Thus we obtain a strengthening of Theorem (1.48): there 
exist functions f € C™ for which the Lewy equation has no solution in D’, 
not merely in C?. 

Theorem (1.49) was the starting point for a considerable body of re- 
search into necessary and sufficient conditions for local solvability. For an 
account of this work, we refer the reader to the expository articles of Treves 
{50}, [51], and to Beals and Fefferman [6]. We mention also that Greiner, 
Kohn, and Stein have found a necessary and sufficient condition on f for 
the Lewy equation Lu = f to be locally solvable; see Stein [46, §XIII.4}. 
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EXERCISES 
1. Given (z9, yo, to) € IR3, define the transformation T of R° by 


T(z, y,t) = (z— £0, y— yo, t — to — 2yox + 2xoy). 


Show that if Z is the Lewy operator (1.47), then L(uoT) = (Lu) oT 
for any function u, and conclude that solving Lu = f(t + 2yor — 2zoy) 
near (Zo, yo, to) is equivalent to solving L(uo T) = f(t — to) near the 
origin. 

2. (Addendum to Exercise 1.) Show that the binary operation * on R® 
defined by 


(a,b,c) *(z,y,t) =(at+z,b+y,c+t+ 2br — 2ay) 


makes R®° into a group. (This group is known as the 3-dimensional 
Heisenberg group. Exercise 1 says that the Lewy operator is invariant 
under left translations on this group, as the transformation T is left 
translation by (—ro, —yo, —to).) 


3. The Lewy operator (1.47) arises in complex analysis because the equa- 
tion Lu = 0 is, in a sense, the restriction of the Cauchy-Riemann equa- 
tions on C? to the hypersurface {(z1,z2) : Imz2 = |z;|?}. More pre- 
cisely, define ® : R? — C? by 


O(z,y,t) = (z+ iy, t+ i(x? + y)). 


Suppose f is a holomorphic function on C?, so that it satisfies the 
Cauchy-Riemann equations (du; + i0,;)f = 0 for 7 = 1,2, where z; = 
u; +iv;. Show that L(fo) =0. 

4. Local non-solvability may occur for relatively trivial reasons when the 
characteristic form of an operator vanishes at a point. Show, for exam- 
ple, that the equation 20d, u—yd,u = z?+y? has no continuous solutions 
in any neighborhood of (0,0) in R?. (Hint: show that 20, — yO, = 04 
in polar coordinates.) 


F. Constant-Coefficient Operators: 
Fundamental Solutions 


A couple of years before Lewy discovered his example, Malgrange and 
Ehrenpreis independently proved that every linear differential operator 
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with constant coefficients has a fundamental solution (a concept we shall 
define below). An immediate corollary is that every constant-coefficient 
operator is locally solvable, and one can deduce regularity properties of the 
solutions by examination of the fundamental solution. In this section we 
shall derive these results, following an argument of Nirenberg [38]. 


Let 
L= Ss: CaO™ 
la|<e 
be a differential operator with constant coefficients. The natural tool for 
studying such operators is the Fourier transform; if f is any tempered 
distribution, we have 


(1.50) (Luy(€) = P(é)ucé), 


where 
P(é)= > Co(2mie)*. 
lalsk 

P is called the symbol of L; this notation relating L and P will be main- 
tained throughout this section. 

We begin by considering the question of local solvability of L. In view 
of (1.50), if f € Co°, it would seem that we should be able to solve Lu = f 
by taking t = f/P, that is, 


(1.51) u(z) = [ow Bae. 


The trouble with this is that usually the polynomial P will have zeros, so 
that f/P is not a locally integrable function and the integral (1.51) is not 
well-defined. However, since f € C'S, Ff extends to an entire holomorphic 
function on C” by Proposition (0.30). The idea will therefore be to make 
sense of (1.51) by deforming the contour of integration so as to avoid the 
zeros of P. 

To this end, we make a simplification. By a rotation of coordinates we 
can assume that the vector (0,...,0,1) is non-characteristic for L, which 
means that the coefficient of ¢* in P(€) is nonzero. After dividing every- 
thing through by it, we may — and shall — assume that this coefficient is 
1, so that 

P(€) = EF + terms of lower order in ny. 


For each fixed €/ = (£;,...,&:-1) € R®~!, we consider P(€) = P(€',én) as 
a polynomial in the single complex variable &,. Let A, (€'),..., Az (é’) be 
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its zeros, counted according to multiplicity and arranged so that for i < j, 
Im), (€') < Im); (€’) and Re A;(€') < Re Aj (€') if Im A;(€’) = Im A; (€'). By 
Rouché’s theorem, a small perturbation of €’ produces a small perturbation 
of the zeros of P(€’, &,), so the functions Im A; (€') are continuous functions 
of €’. Before proceeding to the main results, we need two lemmas. 


(1.52) Lemma. 
There is a measurable function ¢ : R"-! —+ [—k, k] such that for all €' € 
R"-!, 

min{|4(€’) — Im; (é)|: 1 <7 Sk} 21. 


Proof: The idea is simple: There are at most k distinct points among 
Im ;(€') (1 < j < &), so at least one of the & +1 intervals [2m — k — 1, 
2m—k+1) (0 < m < k) must contain none of them, and we can take 
¢(é') to be the midpoint of that interval. That is, for 0< m< k, let 


m = {é': Im; (E’) ¢ [2m —k — 1, 2m—k +1) for j =1,..., k}. 


Then the sets Vj, cover R"~!, and they are Borel sets since Im Aj is con- 
tinuous, so we can take 


m-~1 


(2) = 2m—k for E Vm \ L) Vi (0<m<k). | 
0 


(1.53) Lemma. 

Let g(z) be a monic polynomial of degree k in the complex variable z such 
that g(0) #0, and Jet A1,...,A, be its zeros, Then |g(0)| > (d/2)* where 
d= min|],;|. 


Proof: We have g(z) = (z — A1)---(z— Ax), so 


gz 


) 
(0) 


k 
=[[}1- —| <2" for |zl <a. 
1 Ak 
Moreover, g(*)(z) = k!, so by the Cauchy integral formula, 


kl 9(2) k!2*19(0)| 
ae EM? gel ec SON 
iri Aine |S ae 


kt = |g®(0)| = <= 


which is the desired result. i 


G2 Chapter 1 


(1.54) Theorem. 
If L is a differential operator with constant coefficients on R" and f € 
Co (IR"), there exists u€ C™(IR") such that Lu = f. 


Proof: We employ the notation introduced above. Let ¢ be as in 
Lemma (1.52), and set 


P(€) 


By Lemma (1.53) (applied to g(z) = P(E’, £n + z)) together with Lemma 
(1.52), we see that |P(£)| > 2-* when Imé, = 4(E’). Moreover, by Propo- 
sition (0.30), Fle) is rapidly decaying as | Reé| — oo when |Imé| remains 
bounded. Hence the integrand in (1.55) is bounded and rapidly decaying 
at infinity, so the integral is absolutely convergent. For the same reason, 
we can differentiate under the integral as often as we please and conclude 
that u is C™ and that 


Lu(z) = | | e?t=€ F(€) dep dé. 
nol Im a=¢(E') 


But now the integrand is an entire function which is rapidly decaying as 
|Reé| — 00, so by Cauchy’s theorem we can deform the contour of inte- 
gration in €, back to the real axis. By the Fourier inversion theorem, then, 
Lu= f. L] 


(1.55) u(ey= ff Legh erie HE) dé, dé’. 


The content of Theorem (1.54) can be usefully rephrased as follows. 
A fundamental solution for the constant-coefficient operator L is a dis- 
tribution K on R” such that Lk = 6, where 6 is the point mass at the 
origin. On the one hand, Theorem (1.54) is an immediate corollary of the 
existence of a fundamental solution, for if f € C2° we can take u= K * f: 
then Lu= LK + f =6« f = f. On the other hand, the proof of Theorem 
(1.54) easily yields a fundamental solution. 


(1.56) The Malgrange-Ehrenpreis Theorem. 
Every differential operator L with constant coefficients has a fundamental 


solution. 


Proof: With notation as above, define a linear functional K on C® 


by 
7 io, 
Ale Tete Pe)“ = 
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As in the proof of Theorem (1.54), the integral is bounded by 


C sup (L+IE) "FOILS C’ S> 18% flee, 
[Im €|<k 


Ja]gn+1 


where C and C’ depend only on the support of f, so K is a distribution. 
Moreover, (LK, f) = (K, L'f) where L’ is the operator with symbol P(—€), 
so that (L' f)(—-€) = P(é)f(—€). Hence, as in the proof of Theorem (1.54), 


en=f of Aow= | Oe = 10) = 69. 


Alternatively, one could observe that K + f is the function u defined by 
(1.55), so that LK « f = Lu = f for all f and hence LK = 6. | 


With a fundamental solution K in hand, we can solve the equation 
Lu = f not only when f € C& but when f is any distribution with compact 
support; of course, the solution u will then be a distribution. Indeed, if 
f € €', we have 


(1.57) L(Kef)=KeLf=f, 


since L(K * f) and K + Lf are both equal to LK + f =5* f = f. These 
relations can often be extended to f’s which are not compactly supported, 
but the class of f’s for which they hold will depend on the nature of K. 
We shall see some specific examples in Chapters 2, 4, and 5. 

Fundamental solutions are useful not only for producing solutions of 
differential equations but also for studying their regularity properties. In 
particular, we have the following important result. 

A differential operator L with C™ coefficients is called hypoelliptic 
if any distribution u on an open set 2 such that Lu is C® on 2 must 
itself be C™ on OQ, that is, if all solutions of the equation Lu = f must be 
C™ wherever f is C™. (The origin of this term is the fact that all elliptic 
operators are hypoelliptic, a fact which we shall prove in Chapter 6.) 


(1.58) Theorem. 

If L is a differential operator with constant coefficients, the following are 
equivalent: 

a. Some fundamental solution for L is C® on R" \ {0}. 

b. Every fundamental solution for L is C? on R” \ {0}. 

c. L is hypoelliptic. 
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Proof: If K is a fundamental solution for L, then DK = 6 is C™ on 
IR™ \ {0}, so (c) implies (b). (b) trivially implies (a), so it remains to show 
that (a) implies (c). For this we need a lemma. 


(1.59) Lemma. 
Suppose f and g are distributions on R", f is C@ on R” \ {0}, and g has 
compact support. Then f *g is C@ on R” \ (supp g). 


Proof: Given x ¢ suppg, choose ¢ > 0 small enough so that B,(z) 
and supp g are disjoint, and choose ¢ € C$°(B./2(0)) such that ¢ = 1 on 
B.;4(0). Then we can write 


feg=(¢f)*g+[— fly. 


On the one hand, (1 — ¢)f is a C™ function, so [(1 — ¢)f] *g is C™ 
everywhere. On the other hand, 


supp((¢f) * 9] C {z+ y: 2 € supp ¢, y € suppg}, 


which is disjoint from B,/2(z). Hence, on B,j2(z), f*g = [(1—¢)f] +g is 
c*. 1 


Returning to the proof of Theorem (1.58), let K be a fundamental 
solution for L that is C on R® \ {0}. Suppose u is a distribution on an 
open set 2 C R® such that Lu is C™ on 2. If c EQ, we pick € > 0 so that 
B.(2) C Q, and we shall show that u is C™ on B,j2(z). 

Pick @ € C2°(B.({z)) with ¢ = 1 on Beyo{z). Then L(du) = dLutyv 
where v = 0 on B,;2(z) and outside B,(x). K *(¢Lu) is C™ since dLu € 
Ce, and K *v is C® on B,j2(x) by Lemma (1.59). But by (1.57), 


gu = K+ L(¢u) = K*¢@Lut+K #v, 


so on B,j2(x), u = duis C™. i] 


EXERCISES 


1. Let L = )c$c;(d/dz)! be an ordinary differential operator with con- 

stant coefficients. Let v be the solution of Lv = 0 satisfying v(0) = 

- = vlk-2)(9) = 0, v®-D(0) = cg). Define K(x) = 0 if z < 0, 
K(z) = v(z) if z > 0. Show that K is a fundamental solution for L. 
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2. Show that the characteristic function of {(z,y):2 > 0, y > O} isa 
fundamental solution for 0,0, in R?. 


3. Show that K(z, y) = [27i(x + iy)]~! is a fundamental solution for the 
Cauchy-Riemann operator L = 0, + idy on R?. Hint: if ¢ € co, 


GR Oa AK 1) = — li as CAS LAP i 
4 e-0 c2+4y2>e? 2+ ty 
Use Green’s theorem to show that this equals 
1 dz +idy Lf : 
— ae dé. 
ie 201 Jnr4yIae2 9(2, 9) o+ty =a, Qa Pic poe tare?) 


4. Suppose LD is a constant-coefficient differential operator. Modify the 
proof of Theorem (1.58) to show that if there is a distribution K that 
is C® away from the origin and satisfies LK = 6+ f where f is C™, 
then L is hypoelliptic. 


Chapter 2 
THE LAPLACE OPERATOR 


For the next four chapters we depart from general theory to investigate 
the great trinity of operators from mathematical physics: the Laplace, 
heat, and wave operators. These operators are of fundamental importance 
not only because of their applications but because they are archetypes of 
more general phenomena; indeed, much of the theory of partial differential 
equations as it now stands has its roots in the study of these operators. 

We begin with what is perhaps the most important of all partial differ- 
ential operators, the Laplace operator or Laplacian A on R® defined 
by 


A=)la#=Vv-v. 
1 


It is useful to have a physical model in mind when thinking of A, and per- 
haps the simplest (among several) comes from the theory of electrostatics. 
According to Maxwell’s equations, an electrostatic field E in space (a vector 
field representing the electrostatic force on a unit positive charge) is related 
to the charge density in space, f, by the equation V- EB = f (provided the 
units of measurement are properly chosen: one usually finds 47 f in place 
of f) and also satisfies curl E = 0. (In n dimensions, curl EF is the matrix 
(0, E; — 4; E;).) The second condition means that, at least locally, EB is the 
gradient of a function —u, determined up to an additive constant, called 
the electrostatic potential. We therefore have Au = —V - E = —f, so the 
Laplacian relates the potential to the charge density. For more details, see, 
e.g., Kellogg [31]. 

Throughout this chapter we assume implicitly that we are working in 
IR” with n > 1. Much of the theory is valid also for n = 1 but becomes 
more or less trivial there. 
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A. Symmetry Properties of the Laplacian 


The Laplacian is not only important in its own right but also forms the 
spatial component of the heat operator 0;—A and the wave operator 07—A. 
Why is it so ubiquitous? The answer, which we shall now prove, is that 
it commutes with translations and rotations and generates the ring of all 
differential operators with this property. Hence, the Laplacian is likely to 
turn up in the description of any physical process whose underlying physics 
is homogeneous (independent of position) and isotropic (independent of 
direction). 

More precisely, to say that an operator LZ on functions on R™ commutes 
with translations (rotations) means that L(f oT) = (Lf)oT for any trans- 
lation (rotation) on IR". We also say that a function f on R” is radial if is 
rotation-invariant (f oT = f for all rotations T). Thus f is radial precisely 
when it is constant on every sphere about the origin, or equivalently when 
f(z) depends only on |z]. 


(2.1) Theorem. 

Suppose L is a partial differential operator on R". Then L commutes with 
translations and rotations if and only if L is a polynomial in A — that is, 
L = 3°a;A! for some constants a;. 


Proof: We leave it to the reader (Exercise 1) to verify that L com- 
mutes with translations if and only if D has constant coefficients, say 
L = cad". In this case, we have (Luy(€) = P(€)f(€) where P(é) = 
>. Ca(2ni€)*. Since the Fourier transform commutes with rotations (Propo- 
sition (0.21)), it follows easily that L commutes with rotations if and 
only if P is radial. In particular, this is true when L = J a;A/, ie., 
P(€) = Ya; (27i|€|)?7. On the other hand, since composition with linear 
maps preserves homogeneity, P is radial if and only if each homogeneous 
piece Pj(§) = Qo jajaj Ca(2i)* is radial. But this means that each P; 
depends only on |€|; since it is homogeneous of degree 7 we must have 
P;(€) = b;(27i|é|)? for some constant b;, and since P; is a polynomial we 
must have 6; = 0 when j is odd. In short, P(E) = >> ba;(27ilé|)*4, so 
L= > ba; Al, i 


Since the Laplacian commutes with rotations, it preserves the class of 
radial functions, on which it reduces to an ordinary differential operator 
called the radial part of the Laplacian. For future reference, we compute 
it explicitly. 
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(2.2) Proposition. 
If f(x) = $(r) where z € R® and r = |x|, then 


n—-— 


—*$(r), 


Af(z) =r) + 


Proof: Since dr/Oz; = 2;/r, we have 
= Zi oe zi ” 1, i ' 
afte) = 4 [24] = [#6 (r) + 24'(r) — Zor) 
= 8"(r) + 26(r) - 24'(r). i 


(2.3) Corollary. 

If f(x) = (r) is a radial function on R", then f satisfies Af = 0 on R"\{0} 
if and only if ¢(r) = a+ br?-" (n # 2) or (r) = a+blogr (n= 2), where 
a and 8 are constants. 


Proof: Af =0 means that ¢"(r)/¢'(r) = (1—n)/r. Integration gives 
log g(r) = (1 — n)logr + loge, or ¢/(r) = cr'~". One more integration 
yields the desired result. 1 


EXERCISES 


1. Show that a differential operator L on R® commutes with translations 
if and only if Z has constant coefficients. (Hint: consider the action of 
E on the monomials x%.) 


2. Show that the wave operator 0? — 02 on R? commutes with the Lorentz 
: : . = hé sinh @ 
transformations (hyperbolic rotations) Ty = (oe 4 a) (6 ER). 


B. Basic Properties of Harmonic Functions 


A C? function u on an open subset of R" is said to be harmonic if Au = 0. 
(We shall soon see, in Corollary (2.20), that the hypothesis u € C? can be 
relaxed without changing anything.) We proceed to derive some of the 
basic properties of harmonic functions. In what follows, we will need to 
integrate over various hypersurfaces S that are boundaries of domains 2 
in R"; do will denote the surface measure on S and v will denote the unit 
normal vector field on S pointing out of 2. 


The Laplace Operator 69 


(2.4) Green’s Identities. 
IfQ. is a bounded domain with smooth boundary S and u, v are C! functions 
on 92, then 


(2.5) i} vd,udo = i (vAu+ Vu- Vu) dz, 
S$ 2 


(2.6) [oa — ud,v)do = [oa —uAv) dz. 


Proof: (2.5) is just the divergence theorem (0 4) applied to the vector 
field vVu. (2.6) follows from (2.5) by interchanging u and v and then 
subtracting. 1 


(2.7) Corollary. 
If u is harmonic on Q then [, O,udo = 0. 


Proof: Take v= 1. | 


The following theorem states that the value of a harmonic function at 
a point is equal to its mean value on any sphere about that point. Here 
and in what follows, w, denotes the area of the unit sphere in R® (see 
Proposition (0.7)): 
Qnn/2 
“n= T(n/2) 


(2.8) The Mean Value Theorem. 
Suppose tu is harmonic on an open set 2. If x € Q and r > 0 Is small 
enough so that B,(z) C Q, then 


u(r) = sacs | u(y) do(y) = ae u(x + ry) do(y). 
Sr(z) (0) 


rly, 51(0 


Proof: We first remark that the second equality follows from the 
change of variable y — z+ ry, and that by composing with a transla- 
tion we may assume that z = Q. To prove the first equality, then, we 
use Green’s identity (2.6), where we take u to be our harmonic function, 
v(y) = [yl?-" if n £ 2 or v(y) = logly| if n = 2, and 2 = B,(0) \ B.(0) 
where 0 < € < r. By Corollary (2.3), v is harmonic in Q, and by (0.1), 
éyv is the constant (2—7)r!~" on S,(0) and the constant —(2—n)e!~" 
on S,(0). (The minus sign is there because the orientation of S,(0) is the 
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opposite of the usual one, and the factor (2 — n) should be omitted when 
n= 2.) Thus, by (2.6), 


0O= / (vd,u — ud,v) do — | (vd, u ~ ud,v) do 
S,(0) 5.(0) 


=," | O,udo+ any O,udao 
S,(0) S.(0) 


—(2—n)r'-" [ udo+(2— njcon f udo, 
S-(0) S(O) 


with suitable modifications when n = 2. By Corollary (2.7), the first two 
terms in the last sum vanish, so 


1 
ae ae oe ae 
r?"wn Js,(0) elu Js.(0) 


But u is continuous, so the right hand side, being the mean value of u on 
S.(0), converges to u(0) as e > 0. ' 


(2.9) Corollary. 
If u, Q, and r are as above, 


u(x) = 


n 
Ten 


n 
| u(y) dy = — u(z + ry) dy. 
B,(2) (0) 


wn Jp, 


Proof: Multiply both sides of the equation 


1 
u(z)= 2 [ule + pry) doy) 
Wn JS,(0) 


by p"~} dp and integrate from 0 to 1. 1 


(2.10) The Converse of the Mean Value Theorem. 


Suppose that u is continuous on an open set 2 and that whenever z € 2 
and B,(x) C 9 we have 


1 
u(x) = — u(x + ry) do(y). 
Wn JS1(0) 


Then u € C™(Q) and u is harmonic on Q. 
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Proof: Choose ¢ € C°°(B;(0)) such that f¢ = 1 and ¢(z) = ¥(|2}) 
for some p € C2°(R). Given € > 0, set d-(z) = e~"d(e“1z) and OQ, = {z: 
B{z) CQ}. Then if c € 2, the function y > ¢,(x — y) is supported in 
, and we have 


[unre -vydv= [we -viedy)dv= | 


u(x — y)d(e~!y)ew” dy 
B.(0) 


t 


1 
u(x —e = z-T r)r""! do r 
J (z — ey)6(y) dy 4 i u(x — rey)p(r)r?~" daly) d 


1 1 
wane) f (r)r"~1 dr = u(z) [ pis é(ry)r"—! do(y) dr 
u(z) [ oy) dy = u(z). 


In the first member of this string of equalities we can clearly differentiate 
under the integral as often as we please, since ¢, € C$°. Conclusion: 
u € C™(Q,), and since ¢ is arbitrary, u € C™(Q). Finally, if c € Q., 
the mean value of u on S,(z) is independent of r for r < €, so by the 
substitution z = ry and Green’s identity (2.5) (with v = 1), 


d 
i if u(z +ry)do(y) = I y- Vu(z + ry) do(y) 
r JS1(0) $3 (0) 


1 


= j (r-1z) - Vu(a + z) r!-" do(z) 
S-(0) 


sri” J Oude = af Au. 
S,-(0) B,(0) 


Thus the integral of Au over any ball in 2 vanishes, so Au = 0 in 2. i] 


(2.11) Corollary. 
If u is harmonic on Q then u € C™(Q). 


Proof: Apply Theorems (2.8) and (2.10) in succession. i 
(2.12) Corollary. 
If {uz} is a sequence of harmonic functions on Q which converges uniformly 


on compact subsets of Q to a limit u, then u is harmonic in 2. 


Proof: Since each u, satisfies the hypotheses of (2.10), so does u. § 
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(2.13) The Maximum Principle. 

Suppose Q is a connected open set. If u is harmonic and real-valued on 2 
and sup,eq u(z) = A < oo, then either u(z) < A forallz EQ oru(z)= A 
for alc E. 


Proof: Clearly {zr € 2: u(r) = A} is relatively closed in 2. But by 
the mean value theorem, if u(x) = A then u(y) = A for all y in a ball about 
z (otherwise the mean value on spheres about x would be less than A), so 
this set is also open. Hence it is either 2 or ©. i 


(2.14) Corollary. 
Suppose 2 is compact. If u is harmonic and real-valued on Q and continuous 
on ©, then the maximum value of u on 2. is achieved on 2. 


Proof: The maximum is achieved somewhere; if at an interior point, 
u is constant on the connected component containing that point, so the 
maximum is also achieved on the boundary. | 


(2.15) The Uniqueness Theorem. 
Suppose 2 is compact. If u, and uz are harmonic functions on Q which 
are continuous on 2 and u, = ug on 8, then uy, = ug on N. 


Proof: The real and imaginary parts of uj — ug and uz — wu; are 
harmonic on 2, hence must achieve their maxima on 82; these maxima 
are therefore zero, so u, = Ug. | 


The mean value theorem pertains only to harmonic functions, but the 
maximum principle and its corollaries are valid for solutions of much more 
general partial differential equations. See Protter and Weinberger [(40}, 
Miranda [37], and Exercises 3 and 4. 


(2.16) Liouville’s Theorem. 
If u is bounded and harmonic on R", then u is constant. 


Proof: For any zc € R" and R > |z|, by Corollary (2.9) we have 
tf way f wtuyay 
y) ay — y 
Rn |JBa(2) Br(0) 


|u(z)—u(0)| = 


< ——Ihulleo I dy, 


R°wa 
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where D is the symmetric difference of the balls Bra(z) and Br(0). D is 
contained in the set where R — [z| < |y] < R+|z], so 


R4{z| 


n n 
Jule) ~ w(0)] < Fer—lullo f dy = Frllulles [rt ar 
Rwn R-lel<licRte| R-II 
R+ {zl)" — (KR —- [z|)" 
which vanishes as R.— 00. Hence u(x) = u(0). ' 


EXERCISES 


1. Prove the maximum modulus principle for complex harmonic functions: 
If u is harmonic on Q and continuous on ©, the maximum value of |u| 
on 9 is achieved on 09. (Hint: If |u] achieves its maximum M at 29 
then u(zo) = e'9M; consider v = Re(e~‘#u),) 


2. Suppose u € C?(Q) and z € 2. Show that 


. anf li 
Au(s) = tim == [> [ ete tr tat) ~ u(e)]. 
This gives another proof of the converse of the mean value theorem for 
C? functions. (Hint: Consider the second-order Taylor polynomial of u 
about z. By symmetry considerations, Ss4¢0) rj; = Ss.c0) xyz, = 0 for 


jH#k and Js,(0) 2% =n} Ss,(0) Mt a} =n Is, (0) 1.) 


3. Here is another proof of Corollary (2.14) that works for more general 
operators. Let 2 be a bounded domain in R", and let 


L= >> aje(2)djOr + Db; (2) dj, 


where aj, and 6; are continuous functions on 2 and the matrix (a;z) 
is positive definite on 2. 

a. Show that if v € C?(Q) is real-valued and Lv > 0 in Q, then v 
cannot have a local maximum in 2. (Hint: Given to € Q, by a 
rotation of coordinates one can assume that the matrix (ajx(zo)) is 
diagonal [cf. the discussion of coordinate changes in §1A].) 

b. Show that if zo ¢ 2 and M > 0 is sufficiently large, then w(z) = 
exp[—M|z — zo|?] satisfies Lw > 0 in Q. 

c. Suppose u € C?(Q) M C(M) is real-valued and Lu = 0 in Q. Show 
that maxgu = maxgn u. (Hint: Show that this conclusion holds 
for v = u+ ew where w is as in (b) and € > 0.) 
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4. Let L be as in Exercise 3, and let Mu = Lu+c(z)u where c is continuous 

and nonpositive on 9. 

a. Assume u € C?(Q)NC(Q). By modifying the argument of Exercise 
3, show that if u > 0 and Mu = 0 in Q then maxgu = maxgn uw. 

b. Show that the uniqueness theorem (2.15) holds for M: if u,; and ug 
are solutions of Mu = 0 on 2 and wy, = ug on OC then uy = uz in 
Q. (Hint: Consider 2/ = {2 € 1: uy — uz > OF.) 

c. Show that this conclusion may fail if c > 0. (Make life simple: take 
L = (d/dz)? on R.) 


5. The only distributions whose support is {0} are the linear combinations 
of the point mass at 0 and its derivatives (Folland [14], Rudin [41]). 
Use this fact to prove a generalization of Liouville’s theorem: If u is 
harmonic on R” and |u(x)| < C(1 + |z|)% for some C, N > 0, then u 
is a polynomial. (Hint: The estimate on u implies that u is tempered 
and so has a Fourier transform.) 


6. Suppose u is a harmonic function on a disc D C IR?. Show that there 
is a harmonic function v on D, uniquely determined up to an additive 
constant, such that 0,v = —Qyu and dyv = Ozu. Show also that w = 
u+iv is holomorphic on D, i.e, satisfies the Cauchy-Riemann equation 
(d, + 10y)w = 0. (Hint: One way to define v is via line integrals of the 
differential form (0,u) dy — (Oyu) dz.) 


C. The Fundamental Solution 


In this section we compute a fundamental solution for the Laplacian and 
give some applications. 

One way to find a fundamental solution is by Fourier analysis. Since 
(Au) (é) = —42?|€[?%(€), on a formal level the inverse Fourier transform of 
(—4m?|€|?]—! should be a fundamental solution. When n > 2, this is exactly 
correct: |¢|~? is integrable near the origin by (0.5), so it defines a tempered 
distribution whose inverse Fourier transform is a fundamental solution. A 
similar result holds for n = 1 and n = 2 provided one “renormalizes” |€|~? 
so as to make it a tempered distribution. We shall show how this works, 
and more generally how to compute the Fourier transforms of distributions 
of the form |€|~°%, in §4B. 

However, a more elementary way to obtain a fundamental solution is 
as follows. Since the Laplacian commutes with rotations, it should have 
a radial fundamental solution, which must be a function of |z| that is 
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harmonic on R” \ {0}. By Corollary (2.3), such a function must be of 
the form a + b|z[?-" if n # 2 or a + blog|z| ifn = 2. (Note that these 
functions are locally integrable by (0.5), so they define distributions.) Since 
the constant function a is harmonic even at 0, it contributes nothing and 
can be omitted. It remains to show that the constant 5 can be chosen so 
as to obtain a fundamental solution, and here is the result. 


(2.17) Theorem. 
Let 
(2.18) N(x) = 


z|?-" 


(2 —n)wy 


Then N ts a fundamental solution for A. 


(n > 2); N(2) = = log|z|  (n = 2). 


Proof: The standard way to prove this result is via Green’s identities, 
and we invite the reader to perform this calculation (Exercise 1). Here we 
shall adopt a different method whose computations will be used again later. 
Namely, for € > 0 we consider a smoothed-out version NV‘ of N, 

({z|? + 2)(2—n)/2 
(2—n)wn 
log(leP +2) fy aay 

Cg 
N‘ — N pointwise as « — 0, and N‘ and WN are dominated by a fixed locally 
integrable function for « < 1 (namely, |N| when n > 2, or {log{z}|-+1 when 
n = 2), so by the dominated convergence theorem, NV‘ — N in the topology 
of distributions when « — 0. Hence we need to show that AN‘ — 6 as 
€—+ 0, ie., that (AN*, ¢) > (0) for any ¢ € CO. 

A simple calculation using (2.2) shows that 


AN‘(z) = nw 'e(|a|? + 7)“ 42? = e-Mb(em* 2), 


Nz) = (n > 2); 
(2.19) 


N“(2) = 


where 
Y(2) = AN'(2) = nwy (lel? + 1)-4/?, 
Also, AN*(—z) = AN*(z). Hence, by Theorem (0.13), 


(ANS 4) = / AN*(—2)¢(z) dz = $+ AN“(0) = a¢(0), 


where a = {(x)dz. But by integration in polar coordinates and the 
substitution s = r?/(r? 4 1), ds = 2rdr/(r? + 1), 


fo} n 1 
[ro dz = nf (r? 4 1)7(P42)/2 p91 Op = al s("~2)/2 ds = 1, 
0 


0 
and the proof is complete. i] 
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(2.20) Corollary. 
A is hypoelliptic: that is, if u is a distribution such that Au € C™(Q), 
then u € C™®(Q). In particular, every distribution solution of Au = 0 is a 
harmonic function. 


Proof: Apply Theorem (1.58). | 


Our name JN for the fundamental solution is in honor of Newton, since 
for n = 3 N is the Newtonian potential, i.e., the gravitational potential 
generated by a unit mass at the origin. In terms of electrostatics, N is 
the Coulomb potential, i.e., the electrostatic potential generated by a unit 
negative charge at the origin. 

We can now solve the inhomogeneous Laplace equation Au = f for 
any distribution f of compact support — namely, u= f * N. In fact, this 
formula also works for functions f without compact support provided they 
satisfy conditions to ensure convergence of the appropriate integrals. Here 
is a representative result along these lines: 


(2.21) Theorem. 
Suppose that f € L}(IR"), and that f |f(y)|log|y|dy < 00 in case n = 2. 
Then f*N is well-defined as a Jocally integrable function, and A(f+#N) = f. 


Proof: We assume n > 2 and leave the case n = 2 to the reader. Let 
Xr be the characteristic function of B,(0). Then by (0.5), x1N € L} and 
(l—xi)N € L™ (in fact, x: N € LP for p< n/(n— 2) and (l—yi)N € LP 
for p > n/(n—2)), so f *(x1N) € L! and f*[(1—x1)N] € L©. Moreover, 
xrf — f in DL) as r — 09, 80 (xf) * (xi N) — f *(x1N) in Lt and 
(xrf)*{(—xi)N] > f*[(1—xi)N] in L™. In particular, x, fand (x, f)*N 
converge respectively to f and f *N in the topology of distributions. Thus, 
since x, f has compact support, 


A(f « N] = lim A[(y, f) * NJ] =limy,f = f. i} 


Another interesting application of the fundamental solution N is the 
following representation of a harmonic function on a domain 2 in terms of 
its Cauchy data on OQ. For this purpose it is convenient to regard N asa 
function of two variables z, y € R® by the formula 


(2.22) N(z,y) = N(x — y). 
When we differentiate N(z, y), we shall indicate whether the differentiation 


is with respect to x or y by affixing the subscript z or y to the derivative; 
e.g., Oy N(z, y). 
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(2.23) Theorem. 
Let 0 be a bounded domain with C! boundary S. If u € C}(Q) is harmonic 
in Q, then 


(2.24) u(z)= f [u(y)d,, N(2, y) = dvu(y)N(z, y)] do(y) (z €2). 


Proof: Let N*(z,y) = N*(z — y), the N* on the right being defined 
by (2.19). Since Au = 0 in Q, by Green’s identity (2.6) we have 


| u(y)AyN“(2, y) dy = / [u(y)a, N(x, y) — &u(yN(e, y)] do(y). 
te) Ss 


As « — 0, the right side of this equation tends to the right side of (2.24) 
for each z € 2. (Since z # y for zr €E N andy ES, the singularities of N 
do not appear here.) On the other hand, the left side is just u+* (AN‘*)(z) 
if we set u = 0 outside 2, so the proof of Theorem (2.17), together with 
Theorem (0.13), shows that ux (AN‘)(z) > u(z) ase Oforrz EQ. 3 


The formula (2.24) suggests that we might try to solve the Cauchy 
problem 


(2.25) Au=0o0nQ, uz f and d,u=g on S, 


by the formula 
(2.26) —u(z) = I [f(v)O, N(@, y) — 9(v)N(2,y)] do(y). 


This won’t work in general, for we know by the uniqueness theorem (2.15) 
that the solution of (2.25) (if it exists) is completely determined by f alone. 
The function u defined by (2.26) will be harmonic in Q, since N(z, y) and 
Oy, N(z,y) are harmonic functions of zc € Q for y € S, but it will not 
have the right boundary values unless f and g are related by a certain 
pseudodifferential equation on S. (See §4B, where a special case is worked 
out.) 

As a consequence of Theorem (2.23), we obtain the analytic version of 
Corollary (2.20). 


(2.27) Theorem. 
If f is analytic on an open set 2 C IR" and u is a distribution on 2 such 
that Au = f, then u is analytic on Q. 
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Proof: We already know that u € C™(Q), and it suffices to show 
that for each zo € 2, wu is analytic in a neighborhood of zo. By the 
Cauchy-Kowalevski theorem we can find an analytic function u’ defined on 
some ball B,(zo) C Q and satisfying Au’ = f there. Let B = B,/2(z0), 
S = S,jo(20), and v= u—u'. Then v € C°(B) and Av = 0 on B, so by 
Theorem (2.23), 


sais [ [o(u)av, N(x, y) — dv(y)N(2,y)] do(y) (2 €B). 


But for y € S the functions N(x, y) and 0,, N(z, y) extend to holomorphic 
functions of the complex variable z in the region where | Rex — zo| < r/4 
and |Imz| < r/4. (in the definition (2.18) of N, simply interpret |z| 
as [)> 2?]!/?.) It follows that v(x) extends holomorphically to the same 
region, since one can pass complex derivatives under the integral sign. In 
particular, v is analytic on B,/4(zo), and hence so is u = v + wu’. ' 


After Corollary (2.20) and Theorem (2.27), it is natural to ask what 
one can say about the smoothness of solutions of Au = f when f has only 
a finite amount of differentiability. The simplest guess would be that if f € 
C*(Q) then u € C*+?(Q), but this turns out to be false (except in the one- 
dimensional case, where it is trivially true). However, analogous results are 
valid if one replaces C*() by slightly more sophisticated function spaces. 
One option is to replace continuous derivatives by L? derivatives, and we 
shall explore this in §6B. Another is to consider the Hélder spaces C#+*(Q). 
We have the following result, due to Holder himself: 


(2.28) Theorem. 
Suppose k > 0, 0 < a < 1, and Q is an open set in R”. If f € C*+*(Q) 
and u is a distribution solution of Au= f on Q, then u € C¥t*+2(Q), 


Proof: It suffices to establish the case k = 0, since A(O°u) = dF f. It 
then suffices to prove that if f € C%(Q) then u € C?+9(0’) for any open 1! 
with compact closure in 2. Given such an 9’, pick ¢ € C9°(M) such that 
¢@=10n, and let g = $f. Then A(g*N) = of = f on ', so u—(g + N) 
is harmonic and hence C™ on '. It is therefore enough to prove that if g 
is a C® function with compact support, then g * N € C?+*. This we now 
do. 

To this end, we consider the regularized kernel N‘ defined by (2.19) 
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and its derivatives 

Nj (2) = 0)N“(z) = wy t2;(le/? + ?)-°/?, 
ny xg (|2]? + <2) 42)/? (#3), 
wy (lel? +? — nz})(|z|?+e7)- 42/2 (i = 5). 


g* N* € C™ since NS € C™, and we have 0;(g + N‘) = g + Nf and 
9;0;(g * N*) = g* N§. We also need the pointwise limits of Nf and Nf as 
e— 0: 


Nis(2) = 103 N“(2) = { 


N;(z) = wy*2;[2|-", 
~nw7!2;2;|2|-"~? i#j), 
a (oer eae? a i) 

Let x, be the characteristic function of B,(0) as in the proof of Theorem 
(2.21). Then xi N® > x1N in L! and (1 — x1) Nf > (1—x1)N uniformly 
(uniformly on compact sets in case n = 2). Since g is bounded with compact 
support, it follows easily that g* N‘ — g* N uniformly (on compact sets), 
so g * N is continuous. Likewise, xiNf —> xi Nj in D} and (1 — x1)Nf 
(1 — x1)Nj; uniformly, so g * Nf — g* N; uniformly. This also shows 
that Nf — Nj; in the topology of distributions, so the locally integrable 
function N; is the distribution derivative 0;N, and hence 0;(g*N) = g* N; 
is continuous. 

This simple argument does not work for the second derivatives, because 
the functions N;; are not locally integrable at the origin; this follows from 
(0.5) since they are all homogeneous of degree -n. We must take more 
care to see what happens to g * N§ ase— 0. 

Let us consider first the case i # j. The functions Nf, and their limit 
Nj; are odd functions of x; (and z;), and it follows that their integrals over 
any annulus a < |r| < 6 vanish. For € > 0 we can even take a = 0, and we 
have, for any 6 > 0, 


nite) = f e—wnsnav—ale) NS (oddy 
= | foley) -a(2]NG(du+ foe — NG (vd. 
ly|<o ly|[>o 


Now we can let « — 0 to obtain 


lim g # NE (2) = i [92 —y) ~ 9(2)]Nij(u) dy + | o( — y)Nig(y) dy. 
ag lyl<e lyl>e 


This works because 


[9(z — y) — 9(2) NG (y)| < |[o(z - y) — 9(2)1Niz(y)| < Clyl?~”, 
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which is integrable on |y| < 6, so the dominated convergence theorem can 
be applied and the limiting integrals are absolutely convergent. In fact, the 
convergence is uniform in z since these estimates are, so that 0;0;(g * N) 
is continuous. Moreover, since } is arbitrary, we can let b > oo to obtain 


Aids(a+ NY(2) = fim [Lote v) ~ o(2)INis(v) dy. 
00 Jiyie 
A similar result holds for i= 7. We have 
NE = 1 -1 NE 
5 (2) = ave xr) + Nj,(z), 
where 7 is as in the proof of Theorem (2.17) and 
Nfj(z) = wy (lel? — 2})(lel? + €2) “470, 


Now, the integral J; of Ng,(y) over an annulus a < |ly| < 6, like that 


of Nf with i # j, vanishes. The reason is that J; is independent of j 
by symmetry in the coordinates, so nJj is the integral of )7? Nf; but 
ee N§,; = 0. Hence, the preceding argument, together with the proof of 
Theorem (2.17), shows that 0?(g * N) is continuous and that 


fla NY(z) = Za(2) + Jim [ [ol —v)— ale Nis (0) dy. 
a Oo Jiyl<b 


At this point we have shown that g * N € C?, and the proof will be 
completed by establishing the following general result and applying it to 
the kernels K = N;;. 


(2.29) Theorem. 

Let K be a C! function on R" \ {0} that is homogeneous of degree —n 
(K(rz) = r-" K(x) for r > 0) and satisfies Lovie K(y) dy = 0 for all 
a,b>0. Ifg is a C% function with compact support (0 < a < 1), then the 
function 


A(z) = lim | [g(2 -— z) — g(x) K(z) dz 
b—00 lzl<b 
belongs to C%. 


Proof: h is well-defined by the argument given above for the case 
K = Ni. Given y € R", we wish to estimate h(x + y) — h(r). Let us write 
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h = hy + ho where 


hy(z) = Fh cant — z) — g(x) K(z) dz, 


ho(z) = lim [9(z — z) — g(z)] K(z) dz. 
b0o Jalyl<lzl<o 
We have 
3iyl 
|hi(z)| < af Jz|@"" dz = cf rl dp = Caly|* 
1z1<3ly| 9 


for all z, and hence 
|hi(x +y) — Ai(z)| < [Ai (x + y)| + [hi (z)| < 2Calyl®. 
On the other hand, 


ho(e ty) = im J loletu~ 2) ~ of ]K(2) de 
yl<Jz 
= lim (g(a — z) — g(2)]K(z + y) dz, 


b—r00 Jaly|<jsty|<d 


s0 
ho(z + y) — Ao(z) 


(2.30) = im Ge — 2) — g(x)|[K(z + y) — K(z)] dz 


+ lim E;(b) + Eo, 
b-—+00 
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where £,(b) and E2 are errors coming from the disparity in the regions 
of integration. 2 (b) is the error coming from the symmetric difference 
between the regions |z| < b and |z + y| < 6, which is contained in the 
annulus b — fy] < |z| < 6+ |y|. Assuming 5 > Jy], in this annulus we have 
|z| ~ |z + y| © b, so |K(z)| < Cb-™ and |K(z+y)| < Cb-". Hence £,(b) 


is dominated by 


| llglloob~" dz = Cs] [glloob-" [(6 + ly)” — (6 — |yl)"), 
b—lyl<lz|<o+ly| 


which vanishes as b — oo. E> is the error coming from the symmetric 
difference of the regions {z] > 3]y| and [z+ y} > 3ly], which is contained in 
the annulus 2|y| < |z] < 4]y|. In this annulus we have |z| ~ [z+-y| & [y|, so 


|£2| < ca | lyl7"*% dz = Csly|*. 
2Iyl<lzl<4lyl 
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Finally, to estimate the main term in (2.30), we observe that for |z| > 3[y|, 


|K(z+y)—- K(z)[= 


1 
| y- VE(2 + ty) dt} < |y| sup |VK(z + ty)|. 
0 O<t<i 


Since K is homogeneous of degree —n, VK is homogeneous of degree —n—1, 
so 


IK(z+y)—K(2)] < Celyl sup | jzttyl-"-* < Crlyllzl-"-* (2 > 3iy). 
Hence the main term in (2.30) is bounded by 
Caf tetttallet-b de = Coly} f r2-? dr = Crolul®. 
l21>3{y] 3lyl 


(Note that the condition a < 1 is needed here.) Combining all these 
estimates, we have |h(z + y) — h(r)| < Cly|* as desired. 1 


Theorem (2.28) remains valid if A is replaced by an arbitrary elliptic 
operator L with smooth coefficients. (If L is of order m, the theorem is that 
if Lu € C'+@ then u € C#+™+.) The proof may be found in Stein [46, 
§ V1.5] or Taylor [48, §XI.2]. However, the essential ideas for this general 
result are all contained in the arguments above. 


EXERCISES 


1. Theorem (2.17) is equivalent to the assertion that (N,A¢) = ¢(0) for 
any ¢ € CS. Prove this by applying Green’s identity (2.6) with u = N, 
v = ¢, and Q = B,(0)\ B,(0), where r is large enough so that supp ¢ C 
B, (0). 

2. Show that the formula (2.18) for N in the case n > 2 also yields a 
fundamental solution for A = (d/dz)? in the case n = 1. (The proof of 
Theorem (2.17) works for n = 1, but a simpler argument is available.) 

3. Work out the proof of Theorem (2.21) for the case n = 2. 

4, Generalize Theorem (2.21) for n > 2 to include f in other L? spaces. 

5. Show that the following function is a fundamental solution for A? on 
R*: 

{z[4-" 
Wha NRT ea, A 1 4 } 
2(4 — n)(2 — n)wn cae 218) 
lz/? log {z| 


—log |z| amit, = 
4w,4 fs 4); 8x eee): 
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Can you generalize to find fundamental solutions for higher powers of 
A? 


6. Show that (47|z[{)~4e7¢l?! is a fundamental solution for —A +c? (c € C) 
on R3. 


7. Show that Theorem (2.23) remains valid if the hypothesis that u is 
harmonic is replaced by the hypothesis that u € C?(Q) and the term 
ta N(z, y)Au(y) dy is added to the right side of (2.24). Show also that 
this result remains valid if N is replaced by N —c, for any constant c. 


8. Suppose u is a C? function on an open set 2. Apply the result of 
Exercise 7, with 9 replaced by B,(x) and with c = r?-"/(2 —n)wn, to 
show that if B(x) CQ, 


wee f Wee oe), 
@)= fara aut 


| u(y) do(y). 
S,(z) 


Wyrrat 


(Here we assume n > 2; a similar formula holds for n = 2.) Combining 
this with Exercise 1 in §2B, conclude that if u € C?(Q) is real-valued, 
then u has the “sub-mean-value property” 


1 
u(x) < ee i u(y) do(y) whenever B, (x) C 2 


if and only if Au > 0 in Q. Functions with this sub-mean-value property 
are called subharmonic. 


D. The Dirichlet and Neumann Problems 


In this section we begin a study of boundary value problems for the Lapla- 
cian. The two most important problems, to which we shall devote most of 
our attention, are the so-called Dirichlet and Neumann problems. Through- 
out this discussion, 0 will be a domain in R® with smooth boundary S. 


The Dirichlet Problem: Given functions f on Q and g on S, find a 
function u on Q satisfying 


(2.31) Au= f on Q, u=gonS. 


The Neumann Problem: Given functions f on 2 and g on S, find 
a function u on 2 satisfying 


(2.32) Au= f on Q, O,u= gon S. 
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Of course, we should be more precise about the smoothness assumptions 
on f, g, and u, and if 2 is unbounded we shall want to impose conditions on 
their behavior at infinity. However, for the time being we shall work only 
on the formal ljevel and assume that Q is bounded. We shall not, however, 
assume that 2 is connected. (This added generality is only rarely useful, 
but it makes the theory in Chapter 3 turn out more neatly.) 

The uniqueness theorem (2.15) shows that the solution to the Dirichlet 
problem (if it exists) will be unique, at least if we require u € C(@). For the 
Neumann problem uniqueness does not hold: we can add to u any function 
that is constant on each connected component of 2. Moreover, there is an 
obvious necessary condition for solvability of the Neumann problem: if u 
satisfies (2.31) and 2 is a connected component of 2, by Green’s identity 
(2.5) (with v = 1) we have 


| f= Au= aus | g; 
! a an an! 


which imposes a restriction on f and g. 
The Dirichlet problem is easily reduced to the cases where either f = 0 
or g = 0. Indeed, if we can find functions v and w satisfying 


(2.33) Av =f onQ, v=0onS, 
(2.34) Aw =0o0nQ, w=gonS, 


then u = v+w will satisfy (2.31). Moreover, the problems (2.33) and (2.34) 
are more or less equivalent. Indeed, suppose we can solve (2.33) and wish 
to solve (2.34). Assume that g has an extension g to 2 which is C?; then 
we can find v satisfying 


Av = AgonQ, v=0onS, 


and take u = g — v. On the other hand, suppose we can solve (2.34) and 
wish to solve (2.33). Extend f to be zero outside 2 and set v' = f * N, so 
that Av’ = f. We then solve 


Aw=00nQ, w=v' on S, 


and take v = v' — w. Henceforth when we consider the Dirichlet problem 
we Shall usually assume either that f = 0 or that g = 0. 

Similar remarks apply to the Neumann problem: it splits into the cases 
f =0 and g = 0, and these are roughly equivalent. To derive the analogue 
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of (2.34) from that of (2.33), we assume that there exists 9 € C?(Q) such 
that 0,9 = g on S and solve 


Av = Ag on Q, d,v=00n S, 
To go the other way, we set vu’ = f * N and solve 
Aw =0 on Q, d,w = 0,v' on S. 


There are many approaches to the Dirichlet and Neumann problems, 
and we shall investigate several of them. This is instructive because the 
various methods yield somewhat different results, and also because the 
techniques involved are applicable to other problems. In fact, we shall 
solve the Dirichlet problem by Dirichlet’s principle (§2F), layer potentials 
(Chapter 3), and L? estimates (Chapter 7), and the last two methods will 
also solve the Neumann problem. In addition, we shall obtain explicit 
solutions on a half-space (§2G) and a ball (§2I1). At this point, we sketch 
yet another approach — stil] on the formal level — using the notion of 
Green’s function. 


E. The Green’s Function 


The Green’s function* for the bounded domain Q with smooth boundary 

S is the function G(x, y) on 2 x @ determined by the following properties: 
i. G(x, -) — N(z,-) is harmonic on Q and continuous on %, where N is 

defined by (2.22) and (2.18), and 

ii. G(z,y) = 0 foreach rE QandyeE S. 

Clearly G is unique: for each x € 2, G(z,-) — N(z,-) is the unique solution 

of the Dirichlet problem (2.34) with g(y) = —N (x,y). Thus if we can solve 

the Dirichlet problem, obtaining a continuous solution from continuous 

boundary data, we can find the Green’s function. 

(Green himself gave a simple physical “proof” of the existence of G. 
Let S be a perfectly conducting shell enclosing a vacuum in 2, and let S 
be grounded so the potential on S is zero. Let a unit negative charge be 
placed at s €. This will induce a distribution of positive charge on S to 
keep the potential zero, and G(z,y) is the potential at y induced by the 


* The ubiquitous use of “Green's function” rather than the more grammatical “Green 


function” is an example of what Fowler [19] called “cast-iron idiom.” 
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charges at z and on S. Unfortunately, to impart mathematical substance 
to this argument is a decidedly nontrivial task.) 

On the other hand, if we can find the Green’s function, we obtain simple 
formulas for the solution of the Dirichlet problem. To see how this works, 
we shall have to make some assertions which we are not yet able to prove. 


(2.35) Claim. 
Let Q be a bounded domain with C®@ boundary S. The Green’s function 
G for Q exists, and for each c EQ, G is C@ on D\ {x}. 


Granting this claim, we have: 


(2.36) Lemma. 
G(z,y) = G(y,z) for allz,y EQ. 


Proof: Given z and y, set u(z) = G(z,z) and v(z) = G(y,z). Then 
Au(z) = 6(a — z) and Av(z) = 6(y — z) where 6 is the Dirac distribution, 
so a formal application of Green’s identity (2.6) yields 


G(2,¥) — Gly,2) = i: (G(x, 2)6(y — 2) — Gly, 2)6(« — 2)] de 
= [ice z)0,,G(y, z) — Gly, z)0,,G(z, z)] do(z) = 0, 


since G(x, z) = G(y, z) = 0 for z € S. This argument may be made rigorous 
by replacing G by G-N +N and letting € — 0 as in the proof of Theorem 
(2.23), or alternatively by excising small balls about 2 and y from Q and 
letting their radii shrink to zero as in the proof of the mean value theorem. 
Details are left to the reader. | 


Because of this symmetry, G may be extended naturally to 2 x 2 by 
setting G(z,y) = 0 fore € S. Also, G(., y)— N(., y) is a harmonic function 
on 2 for each y. 

Now, to solve the inhomogeneous equation with homogeneous boundary 
conditions (2.33), we set f = 0 outside 2 and define 


ve) = i G(z,y)f(y) dy = f*N(2) + | [G(z, v) — N(x, y)}F(y) dy. 
n 2 


The Laplacian of the first term on the right is f, and the second term is 
harmonic in a. Also, v(x) = 0 for x € S since the same is true of G(z,-). 
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Next, consider the homogeneous equation with inhomogeneous bound- 
ary conditions (2.34). We assume that g is continuous on S, and we wish 
to find a solution w which is continuous on 9. We can reason as follows: 
suppose the solution w is known, and suppose that w € C!(Q). Applying 
Green’s identity (2.6) (together with some limiting process as in the proof 
of Lemma (2.36)), we obtain 


w(2) = i w(y)8(z)9) aye i [w(y)AyG(z, y) — Aw(y)G(z, »)] dy 


= [ w(a.,62,v) daly) 


for x € Q, since G(z, y) = 0 for y € S. This formula represents w on 2 in 
terms of its boundary values on S. 
Therefore, the obvious candidate for the solution of (2.34) is 


(2.37) w(2) = f o(v)8.,G(2, 9) do(y). 


Since 6,,G(zx,y) is harmonic in x and continuous in y for € Qand y€ S, 
it is clear that w is harmonic in 2. 


(2.38) Claim. 
If 9 € C(S) and w is defined by (2.37) on Q, then w extends continuously 
to1 and w=gonS. 


The function 6,,G(z,y) on Q x S is calleed the Poisson kernel for 2, 
and (2.37) is called the Poisson integral formula for the solution of the 
Dirichlet problem. 

As mentioned above, we shall force the Dirichlet problem into submis- 
sion by other methods, and afterwards, in §7H, we shall return to this 
discussion and prove Claims (2.35) and (2.38). (We shall also verify them 
directly for the unit ball in §211.) 


EXERCISES 
1. Complete the proof of Lemma (2.36). 


2. Show that the Green’s function for (d/dz)? on (0,1) is G(z,y) = 
z(y — 1) for 2 < y, G(z,y) = y(a — 1) forz > y. 
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F. Dirichlet’s Principle 


Given a bounded domain 2 with smooth boundary S, we define the Her- 
mitian form D on C1(Q) by 


D(u,v) = | Vu- Vo. 
2 


D(u,u) = f,]Vul? is the so-called Dirichlet integral of u; physically it 
represents the potential energy in © of the electrostatic field —Vu. 

We note that u — D(u, u)'/? is a seminorm on C1(Q), and D(u, u) = 0 
if and only if u is constant on each connected component of 2. Let H(Q) 
be the completion of C'(Q) with respect to the norm 


{luligy = [owas five) 


H,() can be regarded as a subspace of L?(Q), consisting of functions 
u € L?(Q) whose distribution derivatives 0;u are also in L?(Q), and it is 
a Hilbert space with inner product (u|v)(i) = D(u,v) + f, ud. We shall 
study it in more detail in §6E. 


(2.39) Proposition. 
There is a constant C > 0 such that f, |u|? < CllullZ,) for all ue 019). 


Proof: Extend the normal vector field vy on S in some smooth fashion 
to be a vector field on 9. (For example, extend it to a neighborhood of 
S by making it constant on each normal line to S, then multiply it by a 
smooth cutoff function.) By the divergence theorem (0.4), 


ul? = ul? * = ; uly; 
wi [(uPo)-» S [ates 
=p3 iL [[u(apaz)vy + [(Oju)eeng + ful?10,04 1] - 
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Thus, letting C’ = supp S77 (|¥j| + 10;4;1), 


fiw? sory) f Quayal+ ta3;ul + Iu?) 
Ss 7 Ja 
n 1/2 1/2 
( ter] [freee] em ft 
<c' (an [ +90 f iasur 
Ss ( nt etal . a ju] 


= 2nC! 7 lul? + C’D(u, uw), 


where we have used the Schwarz inequality and the fact that 2ab < a? + b? 
for all positive numbers a,b. Thus we can take C = 2nC". | 


(2.40) Corollary. 
The restriction map u — u|S from C!(Q) to C1(S) extends continuously 
to a map from H,(Q) to L?(S). 


It follows that elements of H,(Q) have boundary values on S which 
are well-defined as elements of L?(S); we denote the boundary values of 
u € H,(Q) by u|S. However, not every L? function on S — indeed, not 
every continuous function on S — is the restriction to S of an element 
of H,(Q). Roughly speaking, the restriction of a function in Hi(Q) must 
possess “L? derivatives of order ” on S. See Exercise 3 and Theorem 
(6.47).) 

Let H9(Q) be the closure of C2(Q) in Hi(Q). Clearly, if f € H$() 
then f|S = 0. (The converse is also true. We shall not prove this, but see 
Proposition (6.50) and the remarks preceding it.) 

We propose to solve the following version of the Dirichlet problem 
(2.34). We assume that the boundary function g is the restriction to S 
of some f € Hi(Q), and we take the statement “w = g on S” to mean that 
w-— f € HP(Q). Thus, given f € Hi(Q), the problem is to find a harmonic 
function w € H(Q) such that w— f € H}(Q). 


(2.41) Theorem. 
Suppose w € Hi(Q). Then w is harmonic in Q if and only if w is orthogonal 
to H°(Q) with respect to D, that is, D(w,v) = 0 for all v € HP(Q). 


Proof: By Green’s identity, if w € C1(M) and v € C2°() then 
Ja wAv = —D(w,v), there being no boundary term since v vanishes near 
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the boundary. Passing to limits, we see that this identity remains true for 
any w € H,(Q). Hence, w is harmonic in 2 <=> w satisfies Aw = 0 in 
Q in the sense of distributions (Corollary (2.20)) <=> f wAv = 0 for all 
vEecr(Q) <> D(w,v) = 0 for alu EC?(Q) <> D(w,v) = 0 for all 
v € HP(Q). 1 


Since the functions u € H;(Q) with D(u,u) = 0 are locally constant, 
hence harmonic, and no such function except 0 belongs to H9(Q), it follows 
from elementary Hilbert space theory that each f € H,(Q) can be written 
uniquely as f = w + where w is harmonic and v € H9(Q). (The Hilbert 
space in question is H,(Q) modulo locally constant functions, with inner 
product D.) Thus w, the orthogonal projection of f onto the harmonic 
space, is the solution of the Dirichlet problem posed above. 

From the norm-minimizing properties of orthogonal projections in a 
Hilbert space, it also follows that solving this Dirichlet problem is equiva- 
lent to minimizing the Dirichlet integral in a certain class of functions. This 
approach to solving the Dirichlet problem via the calculus of variations is 
the classical Dirichlet principle: 


Dirichlet’s Principle. 

If f and w are in H,(Q), the following three conditions are equivalent: 

a. w is harmonic in Q and w— f € H$(Q). 

b. D(w,w) < D(u,u) for all u € Hy(Q) such that u—f € H2(Q). 

c. D(Ww—f, w—f) < D(u,u) for all u € Hi(Q) such that u—f is harmonic 
in Q. 


The reader who is acquainted with the checkered history of Dirichlet’s 
principle — it was stated by Dirichlet, used by Riemann, discredited by 
Weierstrass, and rehabilitated much later by Hilbert — may be surprised 
at the simplicity of the above arguments. Several points should be kept 
in mind. In the first place, 19th-century mathematicians did not have 
Hilbert spaces, or the theory of Lebesgue integration with which to con- 
struct Hilbert spaces of functions, at their disposal. In an incomplete inner 
product space like C1(9) there is no guarantee that orthogonal projec- 
tions will exist. Neither did they have the notion of distribution, much 
less a proof that distribution solutions of Au = 0 are genuinely harmonic, 
a fact which was essential for the proof of Theorem (2.41). On the other 
hand, we have solved the Dirichlet problem in a weaker sense than the old 
mathematicians would have wished: we had to assume that the boundary 
function is the restriction of a function in H;(Q), and we only showed that 
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the solution assumes its boundary values in the sense of Corollary (2.37). 
The first restriction is unavoidable in the context of Dirichlet’s principle, 
but we would like to know, for example, that if the boundary data are 
continuous on S then the solution is continuous on @. This can be proved 
in the setting of Dirichlet’s principle (see John [30]), but we shall derive it 
by different methods in Chapter 3. 


EXERCISES 


In the following exercises, Q is the unit disc in R? and S is the unit circle. 
For m > 0 we set em(z, y) = (x + iy)™, and for m < 0 we set em(z,y) = 


(x — iy), 


1. Show that {e,,}°,, is an orthogonal set with respect to the inner prod- 
uct on L?(Q) and also with respect to the Dirichlet form D on Q, and 
hence with respect to the inner product (-|-)(1) on Hi(Q). Compute 
\lem||c1) for all m, and conclude that a series > cmém converges in 
Hy, (Q) if and only if 57 |m| |em|? < 00. (Hint: polar coordinates.) 


2. Show that {e,,}%, is an orthogonal basis for the space of harmonic 
functions in H,(Q). (It is trivial to verify that each e,, is harmonic. 
To see that they span all harmonic functions, you can use Exercise 6 in 
§2B together with the fact that every holomorphic function in H,(Q) 
has an expansion 5-5’ ¢mém [a Taylor series!]). 


3. If f = So cmem € Hi(Q), the restriction f|S is the function in L?(S) 
whose Fourier series is )> cme’™®. Conclude that a function >> cme"? 
is the restriction of a function in H;(Q) (in fact, of a harmonic function 
in 4(Q)) if and only if 57 [m| |em|? < 00. Find a sequence {cm} such 
that 5 |em| < co but > |m| |cm|? = oo, and hence exhibit a continuous 
function on S that is not the restriction of a function in Hy(Q). 


G. The Dirichlet Problem in a Half-Space 


In this section we shall solve the Dirichlet problem in the half-space {x € 
R" : z, > 0} by computing the Green’s function and the Poisson kernel 
explicitly. Actually, we shall change notation slightly: we replace n by n+1 
and denote z,4; by t. The domain in question is then 


Rit) = {(2,t)€ R”x R:t > 0}, 
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and the Laplacian on it is 


Al+ =i 


Finding the Green’s function for Rit is just a matter of making the 
following simple observation, In physical terms, if a charge is placed at 
(z,t) and an equal but opposite charge is placed at (z,~t), the induced 
potential at the points equidistant from these two charges — namely, the 
points (x, 0) — will always be zero. Thus, the Green’s function for Rt) is 


G((z,t), (y,s)) = N(z-—y,t -s)- N(z—-y, -t~s), 


where N is the fundamental solution given by (2.18) with n replaced by 
n+ 1. This clearly enjoys the defining properties for a Green’s function: it 
satisfies Ary, s)G((z,t), (y, s)) = 6(x ~y, t—s) fort,s > 0, and it vanishes 
whent =QOors=0. 

From this we immediately have the solution of the Dirichlet problem 


A,u+ Oeu= f on RO u(x,0) = 0, 


namely, 


u(x, je. ie G((z,t), (y, s)) f(y, s) dy ds. 


More precisely, if f is, say, bounded with compact support in Ret (these 


conditions can be relaxed), u will be continuous on R{+! with u(z,0) = 0, 
and u will satisfy (A, + 0?)u = f (in the sense of distributions) on R{t?. 
To solve the dual Dirichlet problem, 


(2.42) A,u+ d?u=00n RZ*, u(z,0) = g(x), 


we compute the Poisson kernel. Since the outward normal derivative on 
aRyt? is —0/0t, the Poisson kernel is 


6) re) 
— 5, (2,2), (y, 8)) |< = —3, [Ne t-—s)—N(z—y, -t~- 5)] ,<0 
t 
~ wing (le — yp? + eH 


According to (2.37), the candidate for a solution to (2.42) is then 


2t 
Mt fala a yO 
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In other words, if we set 


2t 


(2.43) Pi(z)= wnsile[? + teFh/e 


(x ER", t>0), 


which is what is usually called the Poisson kernel for Ri, the proposed 
solution is 
u(z,t)=g*P,(x) (convolution on R*). 


We now verify that this works. The key observations are that 
P,(z) = t7" Py (t7! 2) 


and that 


i 


Qwn f%  rP-ldr 
[ Piayaz a f ( gee 
CCD geet 
~ “PP Gn) (r? + 1) F072? 


which by the substitution s = r?/(r? +1) (so r? = s/(1— s)) equals 


CEH TE ses Sire. 
Tare ee 


by the well-known formula for the beta integral. (See Folland [17, Appx. 3] 
or Hochstadt [25, §3.4].) 


(2.44) Theorem. 

Suppose g € L?(IR"), where 1 < p < co. Then u(z,t) = g * P,(x) is well- 
defined on Ry and is harmonic there. If g is bounded and continuous, 
then u is continuous on R{t? and u(x, 0) = g(x). If g € LP where p < co, 
then u(.,t) —> g in the L? norm ast — 0. 


Proof: We note that R € L'NL®, so Rh € L4 for all g € [1, co]; 
hence the integral g + P,(x) is absolutely convergent for all z and t, and 
the same is true if P; is replaced by its derivatives A,P; or 07 P:. Since 
G((z,t), (y, s)) is harmonic as a function of (z,t) for (z,t) # (y, 5), Pi(z) 
is harmonic on i ead and hence so is u: 


A,ut fu=g*(Ast+O)P =0. 


The remaining assertions follow from the calculations preceding the theo- 
rem together with Theorem (0.13). i] 
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The solution to the problem (2.42) is not unique; for example, if u(z, t) 
is a solution then so is u(z,t)+ ct for any c € C. However, if g is bounded 
and continuous on R”, then u(z,t) = g * P(x) is the unique bounded 
solution; see Exercise 1 in §2I. Here we shall prove the analogous result 
with boundedness replaced by vanishing at infinity. 


(2.45) Theorem. 

If g is continuous and vanishes at infinity on R", then u(z,t) = g * P;(z) 
vanishes at infinity on Ro, and it is the unique solution of (2.42) with 
this property. 


Proof: Assume for the moment that g has compact support, say 
g(z) = 0 for |x| > a. Then g € L}, and ||g * Pilloo < IIgllil] Filles < Ct", 
so u(x,t) —+ 0 as t —+ oo uniformly in z. On the other hand, if0 <t < R, 


ju(z, t)| < Ilglls ies [P(e — yl < CRix|-"-* 
vi<a 


for |z| > 2a, so u(x,t) + 0 as x + 00 uniformly for t € (0, R]. This proves 
that u vanishes at infinity when g has compact support. For general g, 
choose a sequence {gn} of compactly supported functions that converge 
uniformly to g, and let un(z,t) = gn * P(x). Then up vanishes at infinity, 
and ut, — u uniformly on Rt? since 


{| — tlloo = sup Ign —9)* Prlloo < sup llgn ~gllcol|Pell: = Ilan — glloo > 0. 


Hence u vanishes at infinity. 

Now suppose v is another solution, and let w = v—u. Then w vanishes 
at infinity and also on the hyperplane t = 0. Thus, given ¢€ > 0, if R is 
sufficiently large we have |w| < € on the boundary of the region |z| < R, 
0<t< R. By the maximum principle (cf. Exercise 1 in §2B) it follows 
that |w| < € on this region. Letting « — 0 and R — oo, we conclude that 
we=0. | 


If t,s > 0, the function u(x,t) = P,4:(x) vanishes at infinity on Rw 
and satisfies (2.42) with g(r) = P,(x), so it follows from Theorem (2.45) 
that 

Prat = P, * Py. 


That is, the functions P, form a semigroup under convolution, and the 
corresponding operators g — g * P, form a semigroup under composition, 
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called the Poisson semigroup. This is a contraction semigroup on L? for 
1 < p < o0, since 
lg * Pellp < UgllplPella = Ilgllp, 


and it is strongly continuous on L? for p < oo by Theorem (2.44). 

Some of the results of this section can also be obtained by using the 
Fourier transform on R". When n = 1 this works out rather simply; see 
Exercise 1. We shal] consider the case n > 1 in §4B. 


EXERCISES 


1. Assume that n = 1. Show that P(é) = e~2*tll either directly or via 
the Fourier inversion theorem. Use this result to give simple proofs that 
u(x,t) = g + P,(x) satisfies (2.42) if g,g € L', and that P, * Py = Prt. 


2. The formula (2.43) for P; makes sense for all ¢ € R, not just ¢ > 0. 
Show that if f € L'(IR") and u(z,t) = f * P,(z) for all (x,t) € R"*, 
then A,u+t O?u = 2f(2)é'(t). 


H. The Dirichlet Problem in a Ball 


We now solve the Dirichlet problem for the unit ball in R", first by com- 
puting the Green’s function and Poisson kernel explicitly, and then by 
expansion in spherical harmonics. We remark that these results are eas- 
ily extended to arbitrary balls by translating and dilating the coordinates. 
Throughout this section, we employ the notation 


B= B,(0), S=d0B= S1(0). 


The Green’s function for B may be found by an idea similar to the one 
we used for the half-space in §2G. Namely, the potential generated by a 
unit charge at z € B can be cancelled on S by placing a charge of opposite 
sign at the point 2/|z|? obtained by “reflecting” 2 in S. As it turns out, 
the magnitude of the second charge should be not 1 but |z|?—”, and a slight 
modification must be made when n = 2. (We shall obtain more insight into 
this when we consider the Kelvin transform in §21.) To see that this works, 
we use the following lemma. 


(2.46) Lemma. 
Ifz,yé€R", c £0, and |y| =1, then 


jx — y| =| {x[~*z — |zly|. 
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Proof: We have 
2 es 1.42 
|z — yl’ = |z/?-22-y+1= \izly| — 2(\a] 1z)- (lzly) + | |x| 12 | 
2 2 

= |Iz| 1p — |z|y| ‘ i] 

Now, assuming n > 2, let us define 
G(z,y) = N(z—y) — |z)?-"N(\z|~72 — y) 
1 2=-n =1 2—n 

Grays [l@-uP = [le- te lel PY 
From the first equation it is clear that G(x, y) — N(z,y) is harmonic in 
y for y # |x|~2z, and in particular for y € B when z € B. The second 


equation, together with Lemma (2.46), shows that G(z,y) = 0 for y € S. 
It also makes clear how to define G at x = 0: 


G(0,y) = {lyP?-" - 1]. 


1 
(2—n)wy 
When n = 2, the analogous formula is 


Gle,v) = 5 [log |e - yl—log|[e\te-lely|] (@ #0), 


G(0,y) = = log lyl. 

Again it is clear that G enjoys the defining properties of a Green’s function. 

Clearly G satisfies Claim (2.35). The symmetry property G(z,y) = 
G(y, x) is not obvious from the formula for G, but it is not hard to verify 
directly by a calculation like the proof of Lemma (2.46). 

Now that we know the Green’s function, we can compute the Poisson 
kernel 

P(x, y) = 0,,G(x, y) (2éB, yeéS). 

Indeed, by (0.1) we have 0,, = y- Vy on S, so for all n > 2, 


eye rt {yay _ lely- (el — lely) 
P( we |z —y|" ||z|-22 — [zy |" 


Since |y| = 1, Lemma (2.46) then implies that 
1 |z? 
wale — yl” 

It is a fairly simple matter to prove Claim (2.38) for 2 = B; in fact, 
we shall obtain the analogue of Theorem (2.44). A bit of notation: if u is 
a continuous function on B and 0 < r < 1, we define the function u, on S 
by 


(2.47) P(z,y)= 


ur(y) = u(ry). 
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(2.48) Theorem. 
If f € L'(S) and P is given by (2.47), set 


u(z)= [ Ple.w) sw) dotw) (2 € B). 


Then u is harmonic on B. If f is continuous, u extends continuously to B 
and u= f on S. If f € L?(S) (1 < p < 00), then u, — f in the L? norm 


asr—l. 


Proof: For each z € B, P(x,y) is a bounded function of y € S, so 
u(z) is well-defined for c € B. It is harmonic since P is harmonic in z 
(since G(x, y) is, or by a direct calculation). 

Next, we claim that 

i. fy P(z,y) do(y) = 1 for all z € B. 
ii. For any yo € S and any neighborhood V of yo in S, 


lim P(ryo,y) do(y) = 0. 
rol S\V 

(ii) is obvious, since {ryo — y|~" is bounded uniformly for 0 < r < 1 and 
y € S\V, and 1—|ryo]? = 1-1? — 0 as r -+ 1. To prove (i), we note that 
since P is harmonic in z, for 0 <r< 1 andy € S the mean value theorem 
implies that 


i <u,2)PO9)= if P(ry’,y) do(y/). 


But another application of Lemma (2.44) shows that P(ry’,y) = P(ry,y’), 
so with z= ry, we have 1 = f, P(z,y’) do(y/’). 

Now, suppose f is continuous, and hence uniformly continuous since S 
is compact. Given € > 0, choose 6 > 0 so small that |f(z) — f(y)| < «€ 
whenever |r — y| < 6, and set Vz = {y € S: |x — y| < 6}. Then, for any 
z€S andr <1, by (i) we have 


[f(z) — u(rz)| = 


[02 ~ Heresy) dew) 
< ef P(rz,y) do(y) + 2||flloo if P(rz, y) do(y). 


By (i), the first term on the right is less than e€, and by (ii), the second 
term is also less than ¢ if 1 — r is small enough. Hence u, — f uniformly 
as r —> 1, and it follows that u extends continuously to B with u = f on S. 
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Finally, suppose f € L?, 1 < p < oo. Given € > 0, choose g € C(S) 
with ||g — filp < €/3. Setting v(x) = [5 P(z,y)g(y) do(y), we have 


If — uellp < MF — gllp + Ig — erllp + lor — telly. 
The first term on the right is < ¢/3, and if 1—r is small enough the second 
term is < €/3 since uniform convergence implies Z? convergence on S. We 
claim that the linear mapping f — u, is bounded on L?(S) with norm < 1 
for all r, so the third term is also < €/3. This assertion follows from the 
generalized Young’s inequality (0.10), since |P| = P and 


[ Pore.) do(y) =1 = 0nP 0,» = f Pire,v)dote) 


by (i) and the mean value theorem. i] 


We now turn to the theory of spherical harmonics. 
Let Py be the space of homogeneous polynomials of degree k on R", 
and let 
H.= {PEP : AP =0}, 
Hy = {P|S: PE Hy}. 
That is, H;, is the space of homogeneous harmonic polynomials of degree k 
and Hy is the space of their restrictions to the unit sphere. The elements 
of Hy, are called spherical harmonics of degree k. The restriction map 
from Hy, to Hy, is an isomorphism; its inverse is the map Y — P where 
P(z) = |e|*Y(|z|"2). 
We denote by r? the function z — 57? 2? on R®, an element of Pp. 


(2.49) Proposition. 
P, = Hy Or? Py_o, where r?Py_2 = {r?P: P € Pe_o}. 


Proof: For P,Q € Px, let {P,Q} = P(A)Q; that is, if P(x) = S> agx? 
then {P,Q} = }0aa0°Q. The form {P,Q} is linear in P and conjugate- 
linear in Q, and it is scalar-valued: applying a derivative of degree k to a 
polynomial of degree k yields a number. Moreover, one readily checks that 
{x*, 2°} = a! if 8 = a and {z%, 2°} = 0 otherwise. Hence, in general, 


oe Agr”, Ss; bz? } = a alagbe, 
so the form {-,-} is a scalar product on Px. 
Now, notice that for any P € Py_2 and QE Py, 
{r? P,Q} = P(8)r?(8)Q = P(A)AQ = {P, AQ}. 
This immediately implies that H, is the orthogonal complement of r?P,_2 
with respect to {-,-}, which completes the proof. i 
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(2.50) Corollary. 
Pe= He Or? He-2 O *He-4 O° 


Proof: Induction on k. ! 


(2.51) Corollary. 
The restriction to the unit sphere of any element of P;, is a sum of spherical 
harmonics of degree at most k. 


Proof: r? =1on 5S. 1 


(2.52) Euler’s Lemma. 
IfQ € Py then )> 259; Q(x) = kQ(z). 


Proof: Exercise. | 


(2.53) Theorem. 
L7(S) = @o Ht, the expression on the right being an orthogonal direct 
sum with respect to the scalar product on L?(S). 


Proof: By Corollary (2.50) and the Weierstrass approximation theo- 
rem (which, by the way, we shall prove in §4A), the linear span of the H,’s 
is dense in L?(S). We must show that H; L Hy if j # k. Given Y; € Hy 
and Y, € Hy, let P; and P, be their harmonic extensions in H; and Hy. 
By Green’s identity (2.6), Euler’s lemma (2.52), and (0.1), 


o= f (RAP. PAP;) = f (ROPs—PrdP)=(b-) f PVPs 
=(k-j)(¥j|Ye). 0 


Remark: Since A commutes with rotations (Theorem (2.1)), the 
spaces H, are invariant under rotations, and one can show that they have 
no nontrivial invariant subspaces; see Exercise 8. Theorem (2.53) thus 
provides the decomposition of L?(S) into irreducible subspaces under the 
action of the rotation group. 


Let 
dy = dim H;, = dim Hi. 


For future purposes we shall need to compute dx. 
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(2.54) Proposition. 


; = (k+n-—1)! 
a arse 


Proof: Since the monomials r% with |a| = k are a basis for P,, dim Py 
is the number of ways we can choose an ordered n-tuple (a1, ..., @n) of non- 
negative integers whose sum is k. Think of it this way: we line up & black 
balls in a row and wish to divide them into n groups of consecutive balls 
with cardinalities a@1,...,Qn. To mark the division between two adjacent 
groups we interpose a white ball between two black balls; for this purpose 
we need n—1 white balls. The number of ways we can do this is the number 
of ways we can take k + n-— 1 black balls and choose n— 1 of them to be 


painted white, which is (k +n —1)!/k!(n — 1)!. t 
(2.55) Corollary. 
haku = eee 

Proof: By Proposition (2.49), dy = dim P, -- dim P;_2. 1 


(2.56) Corollary. 
dy = O(k"~?) as k + 0. 


Proof: d, is a polynomial of degree n — 2 in k. | 


Some remarks on the low-dimensional cases are in order. If n = 1, 
S consists of two points, and there are only two independent harmonic 
polynomials, 1 and x. 1 spans Ho and x spans Hy, and H, = {0} for 
k > 1. If n = 2, Ho is spanned by 1 and, for k > 0, Hy is spanned by 
(x1 +iz2)* and (21 —ix2)*. (These polynomials clearly belong to H;,, and 
dimH, = 2.) Thus if we ser x1 + izo = re’ and take @ as coordinate 
on the unit circle S, we see that H; is spanned by e'*? and e~**®. The 
decomposition of L?(S) into spherical harmonics is therefore just the usual 
Fourier series expansion. 

Back to the general case: For each z € S, consider the linear functional 
Y — Y(z) on Hy. Since Hy C L?(S) is a finite-dimensional Hilbert space, 
there is a unique Zf € H; such that 


Y(z)=(¥ |Z) (¥ € Hi). 


% is called the zonal harmonic of degree k with pole at x. We list 
some of the amusing proerties of Z{ in the following theorem. 
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(2.57) Theorem. 
Suppose z,y € S. 


a. For any orthonormal basis Y;,..., Ya, of Hz, Z{(y) = oe ¥;(z)Y;(y). 
b. Zf is real-valued, and Zj(y) = Z{ (zx). 

c. ZP*(Ty) = Zi (y) for any rotation T. 

d, Z2 (2) = dy/wn. 

e. |ZE113 = de/wn. 

f. ||Zélloo = dk /wn. 


Proof: (a) holds since 


Zi(y) = DZEIY)Ys(y) = 30 ZY) = OVD). 


Moreover, we can take the basis {Y;} to consist of real-valued functions, 
so (b) follows from (a). Next, if Y € H,, since do is rotation-invariant we 


have 
(v |22* oT) = y Y (y)ZF# (Ty) do(y) = I ¥(T~y) ZE* (y) do(y) 
Ss Ss 
= Y oT"'(Tz) = ¥(z). 


But ZZ is uniquely determined by the property (Y |Z{) = Y(z), so (c) 
follows. 


By taking z = y in (c) and noting that rotations act transitively on S, 
we see that ZZ(x) is independent of x. But by (a), Z7(2) = -|Y;(z)|?, so 


a = So INGlH = Sf Ce)? doe) = ff Ze (2) do(2) = wn Zi (2), 
which proves (d). (e) follows from (a), (d), and the Parseval equation: 
ZEB = OUZELYA)? = be = zee) = 
*inally, (f) follows from (d), (e) and the Schwarz inequality: for all y € S, 


= x= ca dx 
IZe(y)l = (Ze | Ze)| S Ze lbllZzlbe = —. ' 
n 


2.58) Theorem. 


7E(y) = Fi(z-y), where F, is an explicitly computable polynomial of 
legree k in one variable. 
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Proof: Let e = (1,0,...,0) € S, and let P € Hy be the harmonic 
extension of Zf; thus P(rz) = r* Z¢(z) for z € S andr > 0. From Theorem 
(2.55c) we see that P(Ty) = P(y) (y € RR") for any rotation T that leaves 
e fixed. Thus, if we set y’ = (y2,..., Yn) and write P(y) = ss yk? P;(y'), 
the polynomials P; are homogeneous of degree j and rotation-invariant 
on R®"-}_ But the only functions which are homogeneous and rotation- 
invariant are constants times powers of |y’|, and these are polynomials only 
if the power is a nonnegative even integer. Thus P;(y’) = C;ly'? where 
C; = 0 if j is odd, so 


{k/2] ea 
P(y) = D> Cayly' Pay”. 
0 


Now, P is harmonic, so by Proposition (2.2) and the identity A(fg) = 
(Af)g + 2Vf-Vg9+ f(Ag), 


0 = AP(y) 
[&/2] ' ; 
= Ory [25(24-+n—3)hy' PP 2UE- + (b-25)(b-25—- ly! Py 9] 
0 


{k/2]-1 
= DS [Oas(b-25)(6- 25-1) + Cay 42(25+2)(2)+n—1)] fy PI yf, 
Qo 


Thus 
(k — 2j)(k — 23-1), 


© (25+ 2)Q7+ 0-1)" 
so the C2;’s are determined by recursion once we know Co, and Co = 
P(e) = Zg(e) = dg/wa by Theorem (2.55d). 

If we restrict y to the unit sphere we have |y'|? = 1—y?, so 


C242 = 


[k/2} k k 
Ziv) = Yo Cad — vy = So djwk = Doyle, 
0 0 0 
where the b;’s can be computed from the C2;’s and the binomial theorem. 
Finally, given z € S, let T be a rotation such that Te = x. By Theorem 


(2.55c), 


ZE(y) = ZE(T~y) = Soj(e- Ty)’ = So 5y(Te «vi = So oy(z-v), 


which completes the proof. 1 
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The polynomials F(t) = )>6;t? are well known special functions. 
When n = 2, Fy is a constant times the Chebyshev polynomial of 
degree k; see Exercise 4. When n > 3, F; is a constant times the Gegen- 
bauer (or ultraspherical) polynomial ero ? of degree k associated 
to the parameter (n — 2)/2; see Exercise 6. For n = 3, ee is known as the 
Legendre polynomial of degree &k. For more about these polynomials, 
see the Bateman Manuscript Project [4] or Hochstadt [25]. 

Let 7, denote the orthogonal projection of L?(S) onto H,. The theory 
of zonal harmonics furnishes us with a simple formula for 7: 


(2.59) Proposition. 
If f € L(S), 


1 f(z) = | ile -wstw) do(y). 
Proof: We have f = ™ f+ g where g 1 Hy. Since Z2f € Hx, 
| Fe(z- y)f(u) do(y) = (F128) = (mf 1 ZE) = mef(2). 1 


It is clear how to solve the Dirichlet problem 
Au=0onB, u=fonS 


by spherical harmonics. If f is itself a spherical harmonic, u is just its 
extension as a harmonic polynomial. But every f is a sum of spherical 
harmonics, so we can extend by linearity. 


(2.60) Theorem. 
a. If f € L?(S), the harmonic function u on B with u = f on S is given 
by 


fos] 


u(rz) = Sort me f(z) (rxéS,0<r<l). 
0 
The series converges in L?(S) for each fixed r < 1, and absolutely and 
uniformly in y = rz forz € S andr <1ro for any ro < 1. 
b. The Poisson kernel (2.47) has the expansion 


co 


P(re,y)=SortFe(z-y) (2, ye S, 0S 7 <I). 
0 


The series converges uniformly for z,y € S and r < ro for any T9 < 1. 


104 Chapter 2 


Proof: (a) Clearly the series )> r*a, f converges in the L?(S) norm 
for r <1 and tends to f in this norm as r — 1. Moreover, since 


WZéll2 < (defn)? < C+ kr“? 
by (2.57e) and (2.56), for r < ro < 1 we have 
Do lr*mef(2)] S Doro F1ZE)| <llvll2 do roll Zélle 
S$ Cllfll2 92 751 + k)O-”? < 00, 
where we have used (2.59) and the Schwarz inequality. Thus the series 
converges absolutely and uniformly. Since mf € Hp, we have r*a,f € Hi, 


so )\r*m,f is harmonic by Corollary (2.12). 
(b) Ifr <ro, by (2.57f) and (2.52) we have 


Yi irt File - vl < Dor Zz(y)I < aS SOP nhl +k)? < 00, 


so the series )- r* F(x -y) converges absolutely and uniformly to a contin- 
uous function of y € S for each c € S and r < 1. Thus to complete the 
proof it suffices to show that for each z € S, r < 1, and f € C(S), 


[Poems dey) = [Sorte vs) dato). 


But this follows by interchanging the summation and integration and using 
(2.58) and part (a). | 


Spherical harmonics also lead to a solution of the dual Dirichlet problem 
(2.61) Au= fon B, u=OonS., 


The idea here is to find an orthonormal basis for L?(B) consisting of eigen- 
functions for A that vanish on S. 

We adopt the following convention: if Y € Hz, we regard Y as a 
function on R" \ {0} by extending it to be homogeneous of degree 0: 


Y(z) = Y(jz|7?2). 


(2.62) Lemma. 
IfY € Hy then AY = —k(k +n —2)r-7Y. 
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Proof: Since P(x) = |z|*Y(|z|~!x) = r*Y (x) is harmonic, by (2.2) 
we have 


0 = A(rFY) = (Ar*)Y 4+ 20(r*). VY + rFaY 
= [ee —1)rk-?2 + Pater] Y¥ + 2kr*-? S > 2j0;¥ + KAY 
= k(k+n—2)r*-2y + Fay, 
since ).2;0;Y = rdY/dr = 0 by (0.1). ' 


(2.63) Lemma. 
If Y € Hy and F(x) = f(|z|) = f(r), then 


n-1 - sete 
r 


A(FY)(2) = [P') Pa 2) str | ¥(z). 


Proof: As in the proof of Lemma (2.62), we have 


VF-VY = £o 


Y 
> 44) ¥(2) = fe = 0, 
so by (2.2) and (2.62), 


A(FY) = (AF)Y¥ +2VF-VY + FAY 


= [rm + ot - Ee] ve). i 


We propose to solve the eigenvalue problem 
Au = —A?u on B, u=OonS. 


(We call the eigenvalue —A? because it will turn out to be negative, but 
for the moment, A is just a complex number.) If we assume that u has the 
form u(x) = f(|z|)Y (2) where Y € H,, by Lemma (2.63) we are led to the 
ordinary differential equation 


k(k+n—2 
(264) p%(r) + 2A oy + fy? - AE HB= AD] ry 50. 
For boundary conditions, we have f(1) = 0, and we shall also require that 
u(x) = [r7* f(r)][r*Y(z)] be smooth at z = 0, ie., that r7* f(r) be smooth 
at r = 0. (The justification for this is that A + A? is hypoelliptic, as we 
shall show in §6B.) 
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(2.64) is essentially a Bessel equation. Indeed, if we set p = Ar and 
a(r) = f(r/A), so that g(p) = f(r) and g'(p) = A-}f"(r), then (2.64) 


becomes 
g''(p) + 7a) + [ me we g(p) = 9. 
Now set h(p) = p("-2)/2g(p). Substituting p?-™/2h(p) for g(p) and sim- 
plifying, we obtain 
(2.65) Wp) + 2H) + [i - hp) =0 


This is Bessel’s equation of order k + (n — on We shall assume the fol- 
lowing facts about solutions of this equation; see Folland [17] or Hochstadt 
[25] for more details. 

In the first place, the only solutions that are bounded at p = 0 are 
multiples of Jk4(n—2)/2(p), where 


ee a oD a 
Jolt) = Do ates TFA (5) 


is the Bessel function of the first kind of order a. Undoing our changes of 
variables, we see that 


<% = —1) \r\ 7 
f(r) = CrO-??2 7, g—ayy2(Ar) = C’r* » ered ot eee (¥) 


[k + = at cl 


jIT((n/2) +k +9) 


Thus r~* f(r) is analytic at r = 0, as desired, and f(1) = 0 if and only if 
Tk 4(n—2)/2(A) = 0. 
Moreover, the zeros of Jx4(n—2)/2 are all real, and since replacement of 
A by —A leaves f(r) unchanged, it suffices to consider the positive zeros. 
Let AL, Az,... be the positive zeros of J,4(n—2)/2, in increasing order, and 
let 
C= 2 Ie senpayAe 
Then 
{Ci Je t(n—2)/2(Abr) pb 1223553 J 
is an orthonormal basis for L?((0,1), rdr). 
Now, let 
Fir) = CerO Ty a (nay 2( Ak) 
Then {f/}?2, is an orthonormal basis for L?((0,1), r°~! dr). Moreover, 
let Y},..., Y,'* be an orthonormal basis for Hy, (as a subspace of L?(S)), 
and set 


Fe" (2) = fillel)¥e"(2). 
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(2.66) Theorem. 
{Fim > k > 0, 1> 1, 1 < m < dy} is an orthonormal basis for L?(B). 
Moreover, AF]/™ = —(AL)?F/™ and Fim(x) = 0 for z € S. 


Proof: The second assertion is obvious from the construction of F/™. 
As for orthonormality, by integrating in polar coordinates we have 


eee On aes paw 
| mere = [hoot ar | ve @ rr @ dots). 
B 0 Ss 


The integral over S equals dm: 6x¢4/, and in particular is zero unless k = k’, 
in which case the integral in r equals dy. 
To show completeness, suppose g € L?(B) is orthogonal to all F/™: 


1 
i ‘| fi(r) YE" (2)g(rz)r"—! do(x) dr = 0 for all k, l,m. 
o Js 


Integrating first over S, by completeness of spherical harmonics we obtain 


1 
fi(r)g(rz)r"~! dr = 0 
0 


for almost every  € S. But then for each such x we have g(rz) = 0 for 
almost every r. Thus g = 0 a.e., and the proof is complete. i] 


We can now solve the Dirichlet problem (2.61) for f € L?(B). Namely: 
if f= Soa Fy”, take u = — (A) 77a" Fy”. 


Since NCP) sn —2y/2(A) is an entire analytic function, its zeros 1, are 
bounded away from 0, so the series for u converges in L?(B), and it follows 
easily that Au = f in the sense of distributions. It will follow from our 
work in Chapter 7 that the series for u actually converges in the norm of 
Hi(B) as defined in §2F (indeed, in an even stronger norm), so that u = 0 
on S in the sense of Corollary (2.40). Moreover, we will show in Chapter 
7 that if f € C*(B) where k > in, then u € C?(B), so u is a classical 
solution of the Dirichlet problem (2.61). (Actually, it would suffice to have 
f € C%(B) for some a > 0; see the remarks in §7G.) 

We conclude by remarking that the same method leads to a solution of 
the Neumann problem 


(2.67) Au=f on B, é,u=0onS 
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by expansion in spherical harmonics and Bessel functions. We simply re- 
place the boundary condition f(1) = 0 for (2.64) by f‘(1) = 0; the corre- 
sponding boundary condition for the Bessel equation (2.65) is 


5(2 — m) Jk +(n—2)/2() + Db 4(n—2)/2(A) = 9. 
If pl, u?,... are the positive numbers satisfying this equation, and 
ky By 


Als v2 uh, 
* GY? = FF = k(n = 2) legen ayra 

then {Aj Je4(n—2)/2(4,")} P21 is an orthonormal basis for L?((0,1), rdr) 
when k > 0. If k = 0, we must also include the function n}/2p(2-")/2 
corresponding to the eigenvalue 4 = 0. (See Folland [17].) The rest of 
the discussion goes through as before: we obtain an orthonormal basis for 
L?(B) consisting of eigenfunctions of A satisfying the Neumann condition 
0,u = 0. By expanding f € L?(B) in terms of this basis, we obtain a 
solution of (2.67) provided that the component of f corresponding to the 
eigenvalue \ = 0 vanishes, that is, provided i f = 0. (As we remarked in 
§2C, this condition is necessary for the solvability of (2.67) in any case.) 


EXERCISES 
1. Show that the Poisson kernel for the ball Br(zo) is 


R? — |z — z0|? 


Pe onRle— ol 


2. Suppose 2 is an open set in R". Show that for any compact sets 
K,K' C Qwith K contained in the interior of K’ and any multi-index 
a there is a constant Cx, such that 


sup |d%u(z)| < Cx, sup |u(z)| 
rek z€K' 


for every harmonic function u on 2. (Hint: First do the case K = 
B, (to), K' = Br(xo), where r < R, by applying the analogue of Theo- 
rem (2.48) for the ball Bp(zo); see Exercise 1.) Conclude that if {uj} is 
a sequence of harmonic functions on 2 that converges to u uniformly on 
compact subsets of 2, then 0%u; —+ 0% u uniformly on compact subsets 
of 2 for every a. 
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. (Harnack’s inequality) Suppose u is continuous on Br(zq), harmonic 
on Br(zo), and u > 0. Show that for jz — zo =r< R, 
1 — (r/R) 
[1+ (r/R)]"~? 


1+ (r/R) 
[1 —(r/R)]"-} 


(Hint: Estimate the Poisson kernel in Exercise 1 by a constant.) 


u(t) < u(z) < 


u(Zo). 


. Show that if we identify R? with C, the 2-dimensional zonal harmonics 
are given by 


Ze" (e*) = (Qn)7!, Ze" (e#*) = 2 cosk($ — 8) for k > 0. 


Verify Theorem (2.57) from these formulas, and show that the poly- 
nomial F, of Theorem (2.58) is x-1T, for k > 0, where 7} is the 
Chebyshev polynomial defined by 7;.(cos 6) = cos k6. 


. If we identify R? with C, (2.47) and Theorem (2.60b) together with 
Exercise 4 show that for n = 2, 


eos, SE = 0 i¢ ok 
1 — 2rcos(@ — ¢) + r? = 2nP(re’,e a hg cos k(@ — ¢). 


Verify this directly by summing the series. (Hint: cos k@ = Ree'®.) 


. The Gegenbauer polynomials C} associated to the parameter \ > 0 are 
defined by the generating relation 


Do CRO)r* = (1 = 2rt + 7). 
it] 


By applying the operator 1 +-A~!r(d/dr) to both sides of this equation 
and using Theorem (2.60b), show that if Fy is as in Theorem (2.58) 
and n > 3, 
2k+n—2_“n—2)/2 
t) = ——_—_—_ t). 
Fy (t) Geos! (t) 


. Solve the Neumann problem 
Au=0on B, OQu=gonS 


by expanding g in spherical harmonics. How is the necessary condition 
Is 9 = 0 used? 
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8. Suppose V is a nonzero vector subspace of H, that is invariant under 
rotations. Show that V = H,. (Hint: Consider the “zonal harmonic” 
Z¥, i.e., the unique element of V such that Y(z) = (Y |Z?) for all 
Y €V. Z§ has properties analogous to Theorem (2.57), and the proof 
of Theorem (2.58) shows that a function with these properties must be 
a constant multiple of 27.) 


9. Show that J_y/2(t) = /2/at cost and J,/2(t) = 1/2/at sint, and then 
show that the orthonormal basis for L?(—1, 1) given by Theorem (2.66) 
in the case n = 1 is 


{cos 4jra ij= 1,3,5,...} U {sin dja :j = 2,4,6,...}. 


Note that if we make the change of variable t = }(z + 1), this basis 


turns into the familiar Fourier sine basis {sinjmé : j = 1,2,3,...} for 
L?(0,1). 


I. More about Harmonic Functions 


Now that we have solved the Dirichlet problem for the ball, we can derive 
some more interesting facts about harmonic functions. 


(2.68) The Reflection Principle. 

Let 0 be an open set in R"*! (with coordinates r € R", t € R) with the 
property that (z,—-t) € Q if (z,t)EQ. Let Q4 = {(z,t) EN:t > 0} and 
Qo = {(z,t) € Q:t =O}. If u is continuous on N4 UN, harmonic on D,, 
and zero on Qo, then u can be extended to be harmonic on 2) by setting 
u(x, —t) = —u(z,t). 


Proof: It is clear that this extension of u is continuous on 2 and 
harmonic on 2 \ Qo. Given (x9, 0) € Qo, we shall show that u is harmonic 
near 29. Let B be a ball centered at tq whose closure is contained in 2. 
By translating and dilating the coordinates (which preserves harmonicity), 
we may assume that ro = 0 and B is the unit ball. Since u is continuous 
on B, we can solve the Dirichlet problem 


Av=0onB, v=uon OB, 


with v € C(B), by Theorem (2.48). By the explicit formula given there 
for v, the fact that u(z,—t) = —u(z,t) implies that v(z, -t) = —v(z,t). 
In particular, v(z,0) = 0. Thus v agrees with u on the boundaries of the 
upper and lower halves of B. By the uniqueness theorem (2.15), v = u on 
each half, so v = u on B. In particular, u is harmonic on B. i 
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Suppose 2 is a neighborhood of zo € R". If u is a harmonic function 
on 2.\ {zo}, u is said to have a removable singularity at zo if u can be 
defined at zo so as to be harmonic on 2. The following theorem says that 
any singularity which is weaker than that of the fundamental solution is 
removable. 


(2.69) Theorem. 
Suppose u is harmonic on 2 \ {xo}. If 


|u(x)| = o(lz—2o/?-") (n> 2) 


or 
u(x)| = o(log|z—o[~*) — (n = 2) 


as Z—+ Zo, then u has a removable singularity at xo. 


Proof: By translating and dilating the coordinates, we may assume 
that to = 0 and that Q contains the closed unit ball B,. (We shall write 
B, for B,(0).) We may also assume that u is real. Since u is continuous 
on OB,, by Theorem (2.48) there exists v € C(B;) satisfying 


Av=0 on B,, v=uon OB. 


We claim that u = v on B, \ {0}, so we can remove the singularity by 
setting u(0) = v(0). Given e > 0 and 6 € (0, 1), consider the function 


u(x) — v(x) —e(|e?-"-1) (n> 2), 
u(x) — v(x) + €log |x| (n = 2) 


on B,\ Bs. This function is real and harmonic on B; \ Bs (Corollary (2.3)), 
continuous on the closure, zero on 0B,, and — by the assumption on u —— 
negative on OB; for all sufficiently small 6. By the maximum principle, it 
is negative on By \ {0}. Letting « + 0, we see that u—v <0 on B, \ {0}. 
By the same argument, v — u < 0 on B; \ {0}, so that u = v on B, \ {0}. 


Our final results concern the behavior of harmonic functions at infin- 
ity. To obtain these, we first need a formula for A in general curvilinear 
coordinates, which is of interest in its own right. 

Let T be a C™ bijection from an open set 2 C R” to an open set 
OY Cc R* with C™ inverse. Let y = T(x), and let Jp = (Oy;/Ax;) and 
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Jp-1 = (x;/dy;) be the Jacobian matrices of T and T~!. Define the 
matrix (g;;) on 2! by 


o = 1)t ~1 = sD 
(9ij(y)) = (Jr-)'(y) Jr-1(y) (x: dy i) 


i 


Then the volume elements on 2 and are related by dz = | det Jp-1|dy = 


Vg dy. 


(2.70) Theorem. 
If u is a C? function on Q and U = ae then 


(2.71) AuoT™! Zoe 2 (ai va5e). 


Moe 


The inverse matrix (g'/) of (gi;) is then 


(94 (y)) = (J J' (TA 'y)) = (= se 
k 


Moreover, let us set 


g = det(9;;) = (det Jp). 


Proof: Given w € oe let W = woT!, Then 


7 BU Oy OW ay; 
_ y J Oy; Ox, Oy; Be, V9 


t,j,k 
aU AW ,, 

= —— ——~ gy") = tj 

Sf am But EMD gy (ag) 

; ~oV 
ea éj ae 
=f (ae (2 V9 by )I 7) wie 
This being true for all w, the result follows. | 


Remark: Rather than regarding y = T(x) as a transformation from 
Q to 0, we can regard y,..., Yn as corvilinear coordinates on 2, and the 
expression on the right of (2.71) gives the formula for the Laplacian of 
a function U in these coordinates. More generally, this is the expression 
for the Laplace-Beltrami operator on a Riemannian manifold with metric 
tensor (gij) in the local coordinates y1,..., Yn- 
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We are particularly interested in the transformation 
y= T(2) = [ele 
on R" \ {0} obtained by inversion in the unit sphere. Since z = |y|~?y, we 
see that 


Ore _ —2yiye , Sik 
Oy lyl* lyl?’ 


—2yiy, , Sik [4 Yk a] bij 
943 = +t + Hk] = 4, 
: > | lyl* lyl?J i tlt = yl?y lylt 
so g = |yl*5;; and g = [y|~4". Thus, if u is a C? function on R" \ {0} and 
U(y) = u(lyl77y), 


a OU 
—2,) _ an 4-2n 
Asu(lul-?4) = ll?* 3 gem [lat 57] 


o’U aU 
_ 2n 4~2n ans 2-2n,,. 
= |yl?" >> ‘ a + (4 — 2n)ly|?-7"y; aa 
a dly|?-" aU 
n+2 yl? nou 


We can add the term Paes to the last expression in square 
brackets without changing anything, since )> 0”|y|?~"/dy? = 0 for y # 0. 
We therefore have 


2 
(2.72) Au(|yl7?y) = lyl"*? So sal"), U(y) = u(ly|~7y). 
J 


If Qc R” \ {0}, we set 
Q= {|z|"?2 :2 EQ}, 


and if uis a function on Q, we define its Kelvin transform 7, a function 
on Q, by 
(x) = [x|?-Pu(|2|722). 


(We have already encountered this in the construction of the Green’s func- 
tion for the ball in §2H.) With the notation of (2.72), we have U(y) = 
ly|"~2u(y), so if we replace u by @ in (2.72) we obtain 


Ai(|y|~?y) = lyl"F? Auly). 


In particular, we have proved: 
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(2.73) Theorem. i 
If u is harmonic on 2 C RK" \ {0}, its Kelvin transform @ is harmonic on Q. 


Now suppose u is harmonic outside some bounded set; then its Kelvin 
transform @ is harmonic in a punctured neighborhood of the origin. We 
say that u is harmonic at infinity if @ has a removable singularity at 0. 
As an immediate corollary of Theorem (2.69), we have: 


(2.74) Proposition. 

If u is harmonic on the complement of a bounded set in R", the following 
are equivalent: 

a. u is harmonic at infinity. 

b. u(x) 3 0 as & + 00 ifn > 2, or |u(x)| = o(log |x|) as x + 00 ifn = 2. 
c. ju(x)| = O(|z|?-") as 2 + 00. 


For future reference we give one more result concerning the behavior of 
harmonic functions at infinity. We denote by 0, the radial derivative, that. 
is, the normal derivative on spheres about the origin: 


d 
Apu(ry) = su(ry) = Do yjiu(ry) — (r > 0, |yl = 1). 
(2.75) Proposition. 
Suppose u is harmonic at infinity. Then |0,u(x)| = O(|z|!~") as 2 — 00; 
moreover, in case n = 2, |0,u(x)| = O(|z|~?) as x — 00. 
Proof: By dilating the coordinates, we may assume that u is har- 
monic outside Brp(0) for some R < 1. The Kelvin transform w is then 
harmonic on the unit ball B,(0) (once we have removed the singularity at 


0) and continuous on its closure. We can therefore expand @ in spherical 
harmonics according to Theorem (2.60): 


(x) = "SV lel*Ye(lel-*2) (Ye € He). 
0 
But then 
oO 
u(z) = |2[?-Pai(|2|-2z) = > |x|?“ FY; (e172). 
0 
Thus if we set x = ry with r > 0 and |y| = 1, 


u(z) = Dor? "FV; (y), 
0 
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so that 
foe] oO 

(2.76) d,u(z) = So (2-n—k)r-? FY; (y) = yin Yo(2—-n—k)r* Vi (y). 
0 0 

If r > 3, say, then certainly 


[e-n- br*¥e(| <2, 


which is bounded independent of r and y since the series )>27*Y,(y) = 
(dy) is absolutely and uniformly convergent in y. Thus [0,u(ry)| = 
O(r!~"). Moreover, if n = 2, the term with k = 0 in the series (2.76) 
vanishes, so the same argument yields |0,u(ry)| = O(r~?). ' 


EXERCISES 


1. Suppose u and v are bounded and harmonic on the half-space Ro 


and continuous on its closure, and that u(#,0) = v(z,0) for c € R”. 
Show that u = v. (Hint: Consider w = u — v.) 


2. Use Theorem (2.70) to calculate the Laplacian in polar coordinates on 
R? and in spherical coordinates on R°. 


3. Let F be a one-to-one holomorphic function on an open set 2 Cc C. 
Regarding 7 = F-! as a map from F(Q) C R? to 2 C R? as in 
Theorem (2.70), show that the associated matrix (gij) is given by 
9i3(z) = |F"(z)|?6;;, and conclude that if u is a C? function on F(Q), 
A(uo F) = |F’|?(Au) o F. 


Chapter 3 
LAYER POTENTIALS 


This chapter is devoted to the solution of the Dirichlet and Neumann prob- 
lems for the Laplacian by the method of layer potentials. This method 
reduces the problems to solving certain integral equations, for which one 
can use the theory of compact operators. 


A. The Setup 


In this chapter Q will be a fixed bounded domain in R" with C? bound- 
ary S, and we set 2’ = R™\Q. Q and ’ will both be allowed to be 
disconnected; however, since S is differentiable there can only be finitely 
many components. (In practice one usually wants to consider connected 
domains, but it is of interest to allow them to have holes, i.e., to allow their 
complements to be disconnected; and as we shall see, the theory is quite 
symmetric with respect to Q and 9'.) We denote the connected compo- 
nents of Q by Q),...,Qm and those of 2’ by 05,94...,01,,, where 0 is 
the unbounded component. 

To deal with the Neumann problem we need to be careful about the 
meaning of the normal derivative, since we don’t want to clutter up the 
theory with extraneous smoothness assumptions. Recall that we have de- 
fined the normal derivative 0, on a neighborhood of S by formula (0.3). 
We define C,() to be the space of functions u € C+(Q)M C() such that 
the limit 

6,u(z) = cli, v(x): Vu(2 + ty(z)) 
exists for each z € S, the convergence being uniform on S. (Thus 0,_u is 
a continuous function on S.) Similarly, we define C,(') to be the space 


Layer Potentials 117 
of functions u € C1(Q’) N C(Q’) such that the limit 


Oy+u(z) = iM, v(x). Vu(x + tyv(z)) 
exists for each x € S, the convergence being uniform on S. The operators 
0,.. and @,4. are called the interior and exterior normal derivatives 
on S. 


We can now state precisely the problems we propose to solve: 


The Interior Dirichlet Problem: Given f € C(S), find u € C(®) 
such that u is harmonic on Q and u =f on S. 


The Exterior Dirichlet Problem: Given f € C(S), find u € C(Q') 
such that u is harmonic on 2! U {oo} and u= f on S. 


The Interior Neumann Problem: Given f € C(S), find u € C,(Q) 
such that u is harmonic on Q and 0,_u= f on S. 


The Exterior Neumann Problem: Given f € C(S), find u € 
C,(") such that u is harmonic on Q’ U {co} and 0,4u = f on S. 


Note that for the exterior problems we require the solution to be har- 
monic at infinity as discussed in §2I; the reason for this is to obtain unique- 
ness results. Note also that the derivative 0,4 for the exterior Neumann 
problem is taken along the inward-pointing normal to 12’; this amounts to 
replacing f by —f if we want the outward normal. 

These four problems are intimately connected with each other, and we 
shall obtain the solutions to all of them simultaneously. To begin with, we 
prove the uniqueness theorems for all four problems. 


(3.1) Proposition. 
If u solves the interior Dirichlet problem with f = 0, then u= 0. 


Proof: This is just the uniqueness theorem (2.15). | 


(3.2) Proposition. 
If u solves the exterior Dirichlet problem with f = 0, then u=0. 


Proof: We may assume that 0 ¢ 0. By Theorem (2.73), the Kelvin 
transform t of u solves the interior Dirichlet problem with f = 0 for the 
bounded domain ! = {z : |z|~?z € 0'}. Hence & = 0, so u = 0. 1 


118 Chapter 3 


(3.3) Proposition. 
If u solves the interior Neumann problem with f = 0, then u is constant 
on each component of 2. 


Proof: By Green’s identity (2.5), 


fivur=- f u(Au)+ f udu = 0. 
2 a s 


Thus Vu = 0 on Q, so u is locally constant on 2. | 


Remark: In this proof, as well as the following ones, the use of Green’s 
identity is not quite obvious since u is not assumed to be in C1(Q). How- 
ever, it is easily justified by replacing Q by the domain 2; whose boundary 
is 

Sp= {x + tv(2) :2E s} 
and passing to the limit as t — 0 from below or above, as appropriate. The 
definitions of 0,. and 0,4 are designed precisely to make this argument 
work. 


(3.4) Proposition. 

If u solves the exterior Neumann problem with f = 0, then u is constant on 
each component of 2', and u = 0 on the unbounded component 24 when 
n> 2. 


Proof: Let r > 0 be large enough so that 2 C B, = B,(0). By 
Green’s identity (2.5), 


| |Vul? = -| wau) — f wdaut | udu 
Br\Q B,\O Ss aB, 


= | ud, 4, 
OB, 


where 0, denotes the radial derivative. Since ju(x)| = O(|z|?~") and 
{0,u(x)| = O(|z|!~") by Propositions (2.74) and (2.75), we have 


7 ud-u 
OB, 


When n > 2, by letting r — 00 we obtain f,,, [Vu|? = 0. Thus Vu = 0 on 
0, so u is locally constant on 0’, and u = 0 on 4 since |u(x)| = O{|z|?-"). 
If n = 2, Proposition (2.75) gives |0,u(x)| = O(r~7), so the same argument 
shows that | f,,,. u0-u] = O(r-') and hence that u is locally constant on 
Q, 1 


< CP 1<C’r?-*, 
oB, 
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We shall see that the interior and exterior Dirichlet problems are always 
solvable. For the Neumann problems, however, there are some necessary 
conditions. 


(3.5) Proposition. 
If the interior Neumann problem has a solution, then [,, f = 0 for j 
1,...,m. 


Proof: This follows immediately from Corollary (2.7). 1 


(3.6) Proposition. 
If the exterior Neumann problem has a solution, then Son f =0 forj 
Pi 


1,...,m’, and also for j = 0 in case n = 2. 


Proof: That f,,, f = 0 for j > 1 follows from Corollary (2.7). If 
n = 2, let r be large enough so that 2 C B, = B,(0). Then Corollary (2.7) 


gives 
| O-u — O,4u= 0. 
aB, an, 


But |0,u(x)| = O(|z|~2) by Proposition (2.75), so the first term on the 
right vanishes as r — 00; since 0,4u = f, we are done. ' 


We now turn to the problem of finding solutions. To begin with, con- 
sider the interior Dirichlet problem. Our inspiration comes from the for- 
mulas (2.24) and (2.37) that represent a harmonic function in terms of its 
boundary values. Suppose we neglect the second term in (2.24) or the dif- 
ference between G and N in (2.37) and try to solve the interior Dirichlet 
problem by setting 


u(z) = if 8,,N(2,u)s(u) do(y), 


where N is the fundamental solution for A defined by (2.18) and (2.22). 
u will be harmonic in ©, but of course it will not have the right boundary 
values in general. However, in a sense it is not far wrong: we shall see that 
ulS = if + Tf where T is a compact operator on L?(S). Thus what we 
really want is to take 


(3.7) seh [ dv, N(2, v)(y) do(y) 
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where i +T¢ = f, and we can use the theory of compact operators to 
handle the latter equation. 

The function u defined by (3.7) is called the double layer potential 
with moment ¢, and its physical interpretation is as follows. If we think 
of the normal derivative 0,,N(z,y) as a limit of difference quotients, we 
see that (3.7) is the limit as t + 0 of the potential induced by a charge 
distribution with density t~1¢(y) on S together with a charge distribution 
with density —t~1¢(y) on the parallel surface S; = {y+tv(y):y € S}. In 
other words, (3.7) is the potential induced by a distribution of dipoles on S 
with density ¢(y), the axes of the dipoles being normal to S. (See Exercise 
2 of §2G for the analogous result for a half-space. There, the Poisson kernel 
is 20,,N(z,y) and the operator T is zero.) 

Similarly, we shall try to find a solution to the Neumann problem in 
the form 


u(z) = i N(2,9)6(y) do(y). 


This is the single layer potential with moment ¢. It is simply the 
potential induced by a charge distribution on S with density —¢(y). 


B. Integral Operators 


Before studying double and single layer potentials, we need to collect some 
facts about certain kinds of integral operators on the boundary S of our 
domain 2 C R"”. (These results also hold if S is the closure of a bounded 
domain in R"~?.) 

Let K be a measurable function on S x S, and suppose 0 <a <n—1. 
We shall call K a kernel of order a if 


(3.8) K(z,y) = A(z, y)|z — yl“ 


where A is a bounded function on S x S. We shall call K a kernel of 
order zero if 


(3.9) K(z,y) = A(z, y) log |z — yl + B(z, y) 


where A and B are bounded functions on S x S. We note that it is im- 
material whether we measure |x — y| in the ambient space or in local co- 
ordinates on S, since the two quantities have the same order of magnitude 
as x —y — 0. Finally, we shail call K a continuous kernel of order 
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a (0 <a@ < n-—1) if K is a kernel of order a and K is continuous on 
{(z,y)ESx Siz Fy}. 

If K is a kernel of order a, 0 < a < n— 1, we define the operator Tx 
formally by 


Tx f(a) = j. K(2,y)f(u) do(y). 


(3.10) Proposition. 

If K is a kernel of order a, 0 < a < n—1, then Tx is bounded on L?(S) 
for 1 < p < 00. Moreover, there is a constant C' > 0 depending only on a 
such that if K is supported in {(z,y) : |x — y| < e}, then 


ITxfllp $ Ce"? *[]Alloollfllp — (@ > 0), 
IITKfllp < Ce" (l]Alloo(1 + |loge|) + []Bllco)IIflloo (a = 0), 


where A and B are as in (3.8) and (3.9). 


Proof: It suffices to prove the second statement, since we can always 
take € to be the diameter of S. Using polar coordinates on S centered at 
xz € S, we see that for a > 0, 


/ (ev) do(v) <All [Je vl dy 


zn~yl|<e 


€ 
S CillAlloo ff r"-?-* dr = CallAlloe" 1°. 
0 


Similarly, f |K (x, y)| do(z) < C2||Alloe"~!~%. The same calculation shows 
that fora = 0, f|K(z,y)|do(y) and f |K(zx,y)|do(z) are dominated by 
"1 (|{ Alloo(1 + [log e|) + ||Bllo). The proposition therefore follows from 
the generalized Young’s inequality (0.10). i] 


(3.11) Proposition. 
If K is a kernel of order a, 0 <a <n-—1, then Tx is compact on L?(S). 


Proof: Given « > 0, set K.(z,y) = K(z,y) if |c — y]l > ©€ and 
K.(z,y) = 0 otherwise, and set Ki = K ~— K,. Then K, is bounded 
on S x S, hence is a Hilbert-Schmidt kernel, so Tx, is bounded on L?(S) 
by Theorem (0.45). On the other hand, by Proposition (3.11) the operator 
norm of Tk ~ Tx, = Tx: tends to zero as € — 0, so Tx is compact by 
Theorem (0.34). 1 
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(3.12) Proposition. 
If K is a continuous kernel of order a,0<a<n-—1, then Tx transforms 
bounded functions into continuous functions. 


Proof: We may assume a > 0, since a continuous kernel of order zero 
is also a continuous kernel of order a for any a > 0. Thus we may write 
K as in (3.8). Given x € S and 6 > 0, set Bs = {yE S: |z— y| < 4}. If 
y € Bs, we have 


Tk fa) — Tk f(v)| = | [t80e,2)— Ku 2/0) doe) 
< a (K(x, 2) + 1K(y, 22) do(2) 
+f IK(e,2)— Kalle do(2), 
S\ Bas 
The integral over Bzs is bounded by 
NAllotMo fle = 21-* + by - 21°} do(2), 
Brus 


and an integration in polar coordinates shows that this is O(6"~!~%). Given 
€ > 0, then, we can make this term less than de by choosing 6 sufficiently 
small. On the other hand for y € Bs and z € S \ Bos we have |z — z| > 
26 and |y —z| > 5, so the continuity of K off the diagonal implies that 
K(a,z)— K(y,z) converges to 0 uniformly in z € S \ Bas as y — x. Hence 
the integral over S \ Bos will be < he if y is sufficiently close to z. i] 


It is convenient to consider J’ as an operator on L?(S) because L?(S) is 
a Hilbert space. However, we really want to deal with continuous functions. 
The following proposition assures us that when we solve the Fredholm 
equation u+ Thu = f, continuous data give us continuous solutions. 


(3.13) Proposition. 
Suppose K is a continuous kernel of order a,0< a<n-—1. Ifué L?(S) 
and u+ Txu € C(S), then u € C(S). 


Proof: Given € > 0, choose ¢ € C(S x S) such that 0 < ¢ < 1, 


¢(z,y) = 1 for |x — y| < fe, and ¢(z,y) = 0 for |z— y| > €. Set Ko = 6K 
and K, = (1— ¢)K. Then by the Schwarz inequality, 


1/2 
ITe,u(2) — Trsy)| tl | IKa(2, 2) ~ Kay,2)P?do(2) 
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Since K, is continuous, the integral on the right tends to zero as y — Z; 
thus Tk, u is continuous. 

Now if we set g = (u+ Txu) — Tx,u, we see that g is continuous 
and that u+ 7Tx,u = g. By Proposition (3.11), if € is sufficiently small 
the operator norm of Tx, on both L? and L® will be less than 1. Then 
I+ Tx, is invertible, and u is expressed in terms of g by the geometric 
series u =), (—Tx,)/g. By Proposition (3.12), each term in the series is 
continuous, and the series converges in the norm of L™, i.e., uniformly, so 
u is continuous. | 


EXERCISES 


1. Suppose K € C(S x S). Show that Tx maps L?(S) into C(S) for 
1 < p < oo. Show also that if {f;} is a bounded sequence in L?(S) 
then {Tx f;} has a uniformly convergent subsequence; in particular, Tx 
is a compact operator on L?(S). (Use the Arzela-Ascoli theorem.) 


2. Show that if K is a continuous kernel of order a on S,O< a<n-—1, 
then Tx is a compact operator on L?(S) for 1 < p < oo. (Use Exercise 
1.) 

3. Extend Proposition (3.12) to show that if K is a continuous kernel of 
order a then Tx maps L?(S) into C(S) for p> (n—1)/(n—1- a). 


C. Double Layer Potentials 


Let ¢ be a continuous function on S. In the section we study the properties 
of the double layer potential with moment ¢, 


(3.14) u(2)= f a, W(z,v)e(v)do(v)  (@ ERS). 
s 
To begin with, we note that forzg € R"\S andyeS, 


_ _(2-y) u(y) 
Ov, N(z,y) = ar Pp 
is continuous in y and harmonic in z (since z-derivatives commute with 
y-derivatives), and it is O(|z['~") as z — oo. Therefore, u is harmonic 
on R"\ S and |u(z)| = O(|z|!-") as 2 — ov, so that u is also harmonic 
at infinity. More interesting and more subtle is the behavior of u near S. 
Before investigating this, we state a technical lemma. 
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(3.15) Lemma. 
There is a constant c > 0 such that for all z,y € S, 


(2 —y)-v(y)| < elz— y)?. 


Proof: Since |(z— y)-v(y)| < |z— y| for all z, y, it suffices to assume 
that [z— y| < 1. Given y € S, by a translation and rotation of coordinates 
we may assume that y = 0 and v(y) = (0,...,0,1). Hence (z — y) -v(y) = 
Z,, and near y, S is the graph of an equation zy = f(z1,...,%n-1) where 
f €C’, f(0) =0, and Vf(0) = 0. By Taylor’s theorem, then, 


Kz —y) -v(y)l = If(zi,-.-,2n-a)I S el(e1,-.-, tar)? < elz[? = efx — yl? 
for |z| < 1, where c depends only on a bound for the second partial deriva- 


tives of f. Since S is compact and of class C?, there is such a bound that 
holds for all y € S, and we are done. I 


We shall give a special name to the kernel 0,, N(z, y) when z and y are 
both on S; namely, we set 


(3.16) K(z,y)=O,,N(z,y) (ES, yES, Fy). 

The reason for this bit of notation is twofold. First, the kernel K is going 
to play a special role in our theory. Second, it is a little dangerous to 
regard K as really the normal derivative of N for z € S; there are some 


delta-functions lurking in the shadows, as we shall see at the end of this 
section. 


(3.17) Proposition. 
K is a continuous kernel of order n — 2 on S. 


Proof: We have 
A(z, y) _ _(—y)-(y) 
K(z,y) = je- gi" where A(z, y) = te Sy 
K(z,y) is clearly continuous for z # y, so the result follows from Lemma 
(3.15). i 


It is therefore reasonable to extend the potential u to S by setting 
(3.18) ulz)= [ K(2v)6y)do(v) = Tele) (2 €5). 


By Proposition (3.12), the restriction of u to S is continuous on S. However, 
u is not continuous on R”: there is a jump when we approach points on 
S by points in R"\ S. This phenomenon shows up most clearly in the 
simplest case, where ¢ = 1. 
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(3.19) Proposition. 


_fl ifzen, 
[amendw={5 HES 


| K(z,y)do(y) = ; ifze S. 
s 


Proof: The result for z € 2’ follows immediately from Corollary 
(2.7), since N(z,y) is C% on © and harmonic on 2 as a function of y 
when z € 92’. On the other hand, if z € Q, let € > 0 be small enough so 
that B, = B,(z) C 2. We can then apply Corollary (2.7) to N(z, -) on the 
domain 2 \ B, as in the proof of the mean value theorem: 


I-n 
0= J %.N(2,» doy) - : is do(y) 


Wy 


— [ Nev) dow) -1. 
Now suppose z € S, and again let B, = B,(z) Set 
S.=S\(SNB), OBL=8BeNQ, IBY = {ye OB, : v(z)-y < Of. 


(Thus 0B?’ is the hemisphere of 0B, lying on the same side of the tangent 
plane to S at z as 2.) On the one hand, clearly 


[ K@vidotw) = him [ _K (au) doty). 


On the other hand, since N(z,-) is harmonic on 2 \ B, and smooth up to 
the boundary S, U0B!, by Corollary (2.7) we have 


0= i K(2, y) do(y) + iz Ov, N(x, y) da(y). 


Thus, taking into account the proper orientation on 0B/, 
l—-n 
€ 


J Ke.) do(u) = — tim ff a,,0 Ce.) day) = ling J aaah. 
s 0 Jost c—-0 Wn Jap! 


But since S is C?, the symmetric difference between 0B) and OB% is con- 
tained in an “equatorial strip” 


{y€ OB. :|y-v(z)|<ele)}, — e(e) = OL”), 
whose area is O(e"). Hence 


| do(y) = | do(y) + O(6") = LeMay + O(6"), 
aB! aBe 


and the result follows. | 
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To extend this result to general densities ¢, we need two preliminary 
lemmas. 


(3.20) Lemma. 
There is a constant C < oo such that for allx ER"\S, 


i la,,N (2, y)|do(y) <C. 


Proof: Let dist(x,S) denote the distance from « to the nearest point 
of S. Fix 6 > 0 with the following two properties: (i) 6 < 1/(2c) where c is 
the constant in Lemma (3.15); (ii) the set of x such that dist(z,S) < $6 isa 
tubular neighborhood of S as in Proposition (0.2). Thus, if dist(x, S) < 46, 
there is a unique zp € S and t € (-26, 36) such that x = zo +tv(z9); Zo 
is the closest point in S to x by the Lagrange multiplier principle. 

Case I: dist(z,S) > 46. Then |d,,N(z,y)| < C16'~" for all y € S, so 


i lav, N(x, wi do(y) < C16!" | de acy 
s Ss 


Case If: dist(z,S) < $6. Let xp be the unique point of S such that 
Z = 2 +tv(xo) with |t] < 46, and let Bs = {y € S : fzo— yl < 5}. 
We estimate the integral of |0,,K| over S \ Bs and over Bg separately. If 
y€ S\ Bs, then 


lz — yl > leo — yl —|z~ 20] > 6 — 36 = 56. 


Hence |8,, N(x, y)| < Ci6'~", so the integral over S \ Bg is bounded by C2 
as above. To estimate the integral over Bs, we note that 


vnlO.,N(x,y)| = EA Mem to) ONS Heo) 
lz ~ zo] + olz0~ yl? 
jz ~ yln 


, 


by Lemma (3.15). Moreover, 
|z — yl? = |x ~ ol? + Jro — yl? + 2(z — 20) - (20 — y), 
and by Lemma (3.15) again, 


]2(x — xo) - (x0 — y)| = 2lz — zollv(zo) - (to — y)| < 2clx — zol|zo — yl’. 
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In particular, since |ro — y| < 6 < 1/(2c), this last quantity is less than 
|z — zol|to — yl, whence 

jz — yl? > $ (Iz — zol” + [z0 — yl’). 
Therefore, 


jz — to| + ¢lzo — yl? 
(\z — zo}? + |zo — yl?)"/? 
C3\z — Zof C3c 


|,N(z,y)| < Cs 


a 
~ (lz — ol? + [20 — yl?)"/? © |zo — yl?-?’ 
so if we set r = |zo — y| and a = |x — zol, integrate in polar coordinates, 


and make the substitution r = as, we obtain 


: a 1 n-2 
fp enweemis ce [ace + ea] eh ar 


0° gs? 2 
<C. — 7 d 6 
s$ «f (i+ 97 s+ C46, 
This last integral converges since the integrand is O(s~”) as s —+ 00, so we 
are done. L] 


(3.21) Lemma. 
Suppose ¢ € C(S) and ¢(zo) = 0 for some zo € S. If u is defined by (3.14) 
and (3.18), then u is continuous at zo. 


Proof: Given e > 0, we wish to produce 6 > 0 so that |u(x)—u(z9)| < 
€ when |x — zo{| < 6. Let C be the constant in Lemma (3.20), and let 
C’ = f, |K(z,y)|do(y) (which is finite since K is a kernel of order n — 2: 
cf. the proof of (3.10).) Choose 1 > 0 so that {¢(y)| < €/3(C + C") when 
ye B,={zES:{z—z9| <n}. Then 


lulz) —ulzoyl sf (10,N(2. 1+ 10,0 (eo, v6) doy) 


+ I 1,,N (2, v) — Or, (20, w)lib(y)| do(y). 
S\B, 


(Of course 6,,N = K when both of its arguments are on S.) The first 
integral on the right is less than 2¢/3. Moreover, if |z — to] < 47 the 
integrand of the third term is bounded and continuous on S\ B, and tends 
uniformly to zero as z —+ Zp. We can therefore choose 6 < dy small enough 
so that the third term is less than €/3 when |z— zo| < 5. and we are done. 
(For future reference we note that 6 depends only on 7 and the uniform 
norm of ¢.) i] 
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We are now ready for the main theorem of this section. If u is defined 
by (3.14), we define the functions u,; on S for small t # 0 by 


u(z) = u(x + ty(z)). 


Thus u; is, in effect, the restriction of u to a surface parallel to S at a 
distance t from S. 


(3.22) Theorem. 

Suppose ¢ € C(S) and u is defined by (3.14). The restriction of u to has 
a continuous extension to Q, and the restriction of u to! has a continuous 
extension to 0. More precisely, the functions u; converge uniformly on S 
to continuous limits u_ and u, ast approaches zero from below and above, 
respectively. u. and uy are given by 


u_(z) = 1é(2) + i K(z, y)6(y) do(y), 


uy (z) = —36(2) + [ K (a, v)6(v) do(y), 


that is, 
u. = 464+T7kd, us = -$¢64Tk¢. 


Proof: If z €S and t < 0 is sufficiently small, then 2 + tv(z) € Q, so 
by Proposition (3.19), 


(2) = 4(2) i 8,,N (2 + tr(2), y)do(y) 
+ [ame + tv(z), y)[6(y) — $(z)] do(y) 


aie [ dN (a + tu(z), v)[d(y) — 4(2)} do(y). 


By Lemma (3.21), the second integral is continuous in ¢ as t -+ 0, so by 
Proposition (3.19) again, 


lim ue(2) = o(2) + [ K(2,n)olu) doy) ~ o(2) f K(e, dou) 
= bo(x)+ [ K(2,wlw) dol). 


If t > 0, the argument is the same except that 


(2) [ 3.,N(2 +tr(2), ») daly) = 0. 
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The uniformity of the convergence follows from the proof of Lemma (3.21): 
we have only to observe that since S is compact, for any « > 0 we can 
choose 7 > 0 so that |¢(x) — 4(y)| < €/3(C + C’) for all z,y € S such that 
jz~y] <n. ' 


(3.23) Corollary. 
o = UL — Uy. 


Theorem (3.22) may be interpreted as follows. If for small t 4 0 we 
define the function K, on S x S by 


K,(z,y) = 0, N(x+tv(z), y), 
then for each z € S, 


i j)al : i § jak : 
im, Kel, )= 352 + K(x, ), spdim_, Kelz, )= zor + K(x, )s 
where 6, is the point mass at x and both sides are interpreted as distribu- 
tions (or measures) on S. 


D. Single Layer Potentials 


We now consider the single layer potential 


(3.24) u(r) = [ N(x, y)(y) do(y) 


with moment ¢, where ¢ € C(S). As with the double layer potentials, it is 
clear that u is harmonic on R"\ S and that |u(z)| = O(|z|?~-") as 2 — 00 
when n > 2, so that u is harmonic at infinity for n > 2. (If n = 2, in general 
we only have |u(z)| = O(log |z|) as x — 00; we shall say more about this 
later.) Moreover, the restriction of N to SxS is clearly a continuous kernel 
of order n — 2, so u is also well defined on S. 


(3.25) Proposition. 
If 6 € C(S) and u is defined by (3.24), then u is continuous on R”. 


Proof: We need only show continuity on S, and the proof is very 
similar to that of Lemma (3.21). Given zg € S and 6 > 0, let us set 
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Bs = {y€S:|zo—y| <5}. Then 


Ju(z)— ule) < fN(e, ydlu)l dav) + | [N(20, 9)6(v)| doy) 
Bs Bs 
+ I sp, N12) ~ Neo, lo) dot). 


Since ¢ is bounded and N(z,y) = O(|z — y|?-") (or O(log |z — y|~*) if 
n = 2), an integration in polar coordinates shows that the first two terms 
on the right are O(5) (or O(6log6~') if n = 2). Given € > 0, then, we 
can make these terms each less than €/3 by choosing 6 small enough. If we 
now require that |x — x9| < 46, the integrand in the third term is bounded 
on § \ Bs and tends uniformly to zero as x — 20, so by choosing |x — zo| 
small enough we can make the third term less than ¢/3 also. i] 


Now let us consider the normal derivative of u. Let V be the tubular 
neighborhood of S given by Proposition (0.2). Recall that we have defined 
the normal derivative 0, on V by formula (0.3); thus for c € V\ S we have 


(3.26) ayu(2) = i 8,,N(2, y)6(y) do(y). 


This looks just like a double layer potential except that 0, is applied to N 
with respect to x instead of y. In fact, since N(z,y) = N(y, 2), Ov. N(z,y) 
is just 0,,N evaluated at (y,z). In particular, if we set 


K*(z,y) = K(y,2), 


the right hand side of (3 26) makes sense for z € S if we interpret it as 
(3.27) [ 8.6) dou) = Tx (2). 
Ss 


Since K is a continuous kernel of order n — 2, so is K*; thus (3.27) defines a 
continuous function on S by Proposition (3.12). Moreover, since K is real- 
valued, it is easily checked that Tx- is the adjoint of Tx as an operator on 
L?(S); hence (3.27) is just Ty ¢(z). 

As might be expected, there is a jump discontinuity between the quan- 
tities defined by (3.26) on V \ S and by (3.27) on S. Indeed, we have the 
following theorem. 
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(3.28) Theorem. 
Suppose ¢ € C(S) and u is defined on R" by (3.24). Then the restriction 
of u to D (resp. 2) is in C,(Q) (resp. C,(O')), and for x € S we have 


8,.u(2) = ~16(2) + iy, K(u,2)6(u) do(y); 
A4u(z) = 44(z) + i K(y,2)6(y) do(y), 


that is, 
O-u = —-4364+TK4, Ozu = 164+TK¢. 


Proof: We already know that u is everywhere continuous. Consider 
the double layer potential 


(2) = ff &,N(2, nly) doy) 
on R"\ S, and define the function f on the tubular neighborhood V of S 
by 
f(e) = v(z) + 8, u(z) (eéeV\S), 
Tx o(z)+Tz 2) (2 € 5). 


We claim that f is continuous on V. The restrictions of f to V\ S and 
S are continuous, so it suffices to show that if ro € S and z = 29 + ty(z9) 
then f(x) — f(t0) + 0 ast — 0, the convergence being uniform in zo. But 


f(z) — f(zo) 


Z I [a,.N (2, y) + &,N(z,¥) ~ 8.20, y) — Or, (20, »)] 6(y) do(y). 


We proceed as in the proof of Lemma (3.21): write this expression as an 
integral over Bs = {y € S: |zo — y| < 5} plus an integral over S\ Bs. As 
before, the integral over S \ Bs tends uniformly to zero as x —> ro. On the 
other hand, the integral over Bs is bounded by 


{Iblleo I |a.,. N(x, y) + ,.N(2, »)| do(y) 


plus the same expression with z replaced by zo. Thus it suffices to show 
that for all x on the normal line through zo, 


i: [a..N(2,y) + &,N(2, »)| doy) 
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can be made arbitrarily small by taking 6 sufficiently small, independent 
of z and zg. Now 


a Nagas Ca) a ieee vy) 


Wylz — yl” ; wylz — y|” , 


so 


a), N(2,y) + &,N(2,y) = Ea Yee) = vy) 


But |v(zo) —v(y)| = O(|z0— yl) since v is C1, and |x —y| > Clxo —y| since 
z is on the normal through zp. Hence 


|8,.N(z,y) + &,N(z,y)| < Clo — y!?", 


and the integral is dominated by 
5 
i) r2-M ph 2 dp = 6, 
0 


Thus f = v + 0,u extends continuously across S. Therefore, by Theo- 
rem (3.22), for all c € S we have 


Tx $(2) + Te o(2) = v_(2) + O-u(x) = 5 ¢(2) + Tr o(2) + ,-u(z), 


so that 
a,-u(z) = —34(z) + Ti o(2); 


and also 
Tx o(z) + T(z) = 04 (2) + O4u(z) = —34(z) + Tk d(z) + Ov4u(z), 


so that 
A4u(2) = 34(2) + Tk d(2). 


The convergence of 0,u(xz + tv(x)) to 0,4u(xz) is uniform in z since the 
same is true of v and v + O,u. | 


(3.29) Corollary. 
¢ = O,4u— Ou. 


We conclude the discussion of single layer potentials with three lemmas 
that will be needed in the next section. 
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(3.30) Lemma. 
If $€C(S) and 4¢4+7y¢= f, then foo= fy f. 


Proof: By Proposition (3.19) and Fubini’s theorem, 
ip f(x) do(2) = 4 i $(2) do(x) + [ yf K(y,2)4(y) do(y) do(z) 
= i $(2) do(z) + 3 I 4(y) do(y) = fh g(x) do(z). 4 


(3.31) Lemma. 

Suppose n = 2. If ¢ E€ C(S), the single layer potential u with moment ¢ 
is harmonic at infinity if and only if fy ¢ = 0, in which case u vanishes at 
infinity. 


Proof: We have 
uz) = 5 flog le — vl ~ tog )(v) day) + EEE f(y) doy). 


Since log |z—y|—log |z| — 0 uniformly for y € S as x — oo, the first integral 
vanishes as x — oo. In view of Proposition (2.74), the result follows. a 


(3.32) Lemma. 
Suppose n = 2. If ¢ € C(S), fs 4 = 0, and the single layer potential u 
with moment ¢ is constant on 92, then ¢ = 0 (and hence u = 0). 


Proof: By Lemma (3 31), u is harmonic at infinity, so if u= c on 9, 
u solves the exterior Dirichlet problem with f = c. But the solution to this 
problem (unique, by Proposition (3.2)) is u = c. Thus u = c everywhere, 
so ¢ = 0 by Corollary (3.29) (or because 0 = Au = ¢dc). f 


EXERCISES 


1. Let Q = Br(0), S = Sr(0). Show that the single layer potential u(r) 
with moment ¢ = 1 is given by 

2—n 

Imex(leL RN" (n> 2), loglmax(lzl, R)] (n = 2). 

That is, the potential generated by a uniform distribution of charge 

on a spherical shell is constant inside the shell, and outside the shell it 

behaves as if the charge were all concentrated at the origin. (Hint: Show 

that u is radial and that the asserted formula for u is asymptotically 
correct as x — 00, and apply Corollary (2.3).) 
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2. Use the result of Exercise 1 to show that if f € L1(IR") is radial and 
u = f * N is the associated potential, then 


u(x) = N(x d dy. 
(2) = N(2) tas Hu) dy + Loe f(y) dy 


E. Solution of the Problems 


We can now apply the Fredholm theory to solve the Dirichlet and Neumann 
problems. For f € C(S), consider the integral equations 


(3.33) 64+ Tx = f, -364+Tko=f, 
564+ TKO =f, -36+TKO=f, 
where K is defined by (3.16). By Proposition (3.13) (with K and f replaced 
by 42K and +2f), the solutions ¢ will be continuous on S, if they exist. 
Therefore, by Theorems (3.22) and (3.28), the double layer potential u 
with moment ¢ will solve the interior (resp. exterior) Dirichlet problem if ¢ 
satisfies 4¢+T¢ = f (resp. —46+7K¢ = f), and the single layer potential 
u with moment ¢ will solve the interior (resp. exterior) Neumann problem if 
o satisfes —1¢+77¢ = f (resp. 36+ ¢ = f). There is one exception: if 
n = 2, in general the single layer potential will grow like log|z| at infinity 
and so will not be harmonic at infinity. However, for n = 2 there is an 
extra necessary condition for the solution of the exterior Neumann problem 
given by Proposition (3.6), and this condition is equivalent to harmonicity 
at infinity by Lemmas (3.30) and (3.31). Thus if the integral equations 
are solvable and the necessary conditions are satisfied, the boundary value 
problems are solvable. (We shall soon see that the Dirichlet problems are 
solvable in any case.) 

We proceed to study the solvability of the equations (3.33). By Corol- 
lary (0.42), this means identifying the eigenspaces 


V4 = {6:TKd = 34}, V_={¢:Tk¢=—}4}, 
We ={6:TRo=46}, W_={6:TKo= 4}. 


Here ¢ is allowed to range over either L?(S) or C(S); the result is the same, 
by Proposition (3.13). 
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Recall that 2 has components Q),...,Qm, and that 0’ has components 
6,--., Qf, where 2 is unbounded. We define functions a4,...,@m and 
a{,...,a4,, on S by 

_ jl ifze6a;, 

ee) = e otherwise, 
. ‘ 

ai (z) = 1 ifz €d0;, 


0 otherwise. 


(3.34) Proposition. 
aj € V4 forj=1,...,m, and aj € V_ forj=1,...,m’, 


Proof: Since 


Teaj= [ K(2v)dolw), Tray = fl K(e,y) dow), 
B 


3 


the result follows by applying Proposition (3.19) with Q replaced by Q; or 
0;. The sign is reversed for a; because the normal v points into Q'. 1 


Clearly aj,...,@m are linearly independent, so dimVy > m, and by 
Theorem (0.38b), dim W, = dimV,. On the other hand, suppose 8 € W,, 
and let w be the single layer potential with moment f. Since 0,_w = 0 by 
Theorem (3.28), w is constant on each 2; by Proposition (3.3), so we can 
define a linear map from Wy to C” by 


(3.35) B — (w]Qy,..., w]Qn). 


If n > 2, this map is injective. Indeed, if wjQ = 0, then w solves the 
exterior Dirichlet problem with f = 0, so w|Q! = 0 by Proposition (3.2). 
Hence w = 0 everywhere, so 8 = 0 by Corollary (3.29). It follows that 
dimW, < m, so dimW, = m and the map (3.35) is an isomorphism. 

If n = 2, the preceding argument breaks down because w need not 
be harmonic at infinity. (A counterexample to the injectivity of (3.35) is 
provided by Exercise 1 of §3D, with R = 1. In that situation, V, = Wy, 
because K(x, y) = K(y,z) by explicit calculation; the function ¢ = 1 on 
S belongs to Vi = W4; and the associated potential u vanishes on 2.) 


However, if we set 
wt = {pew,  [a=oh, 
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in view of Lemma (3.31) the preceding argument shows that the restriction 
of the map (3.35) to W§ is injective, and by Lemma (3.32) its range does 
not contain the vector (1,1,...,1). Hence dim W§. < m—1. But dim W4. > 
dim W,, —1 since wi is defined by the vanishing of one linear functional, so 
again it follows that dimW, = m. In short, we have proved the following 
result. 


(3.36) Proposition. 

The spaces V4. and W4 have dimension m. Moreover: 

a. Ifn > 2, for each (ai,...,am) € C” there is a unique B € Wy such 
that the single layer potential w with moment f satisfies w\Q; = aj for 
j=il,....m. 

b. Ifn = 2, there is an (m — 1)-dimensional subspace X of C™ such that: 

i C™ =X @C(i1,1,...,1); 

ii. for each (a,,...,4m) € X there is a unique B E we such that 
the single layer potential w with moment f satisfies w|Q; = a; for 
j=l,...,m. 


A similar argument applies to V_ and W_. Indeed, we have dimV_ = 
dim W. > m' by Theorem (0.38b) and Proposition (3.34). If 8 € W_ and 
w is the associated single layer potential, then w is constant on each 9}, 
and w = 0 on 4 since w vanishes at infinity. (This is true even when 
n = 2. Indeed, every 8 € W- satisfies [, @ = 0 by Lemma (3.30), so its 
single layer potential vanishes at infinity by Lemma (3.31).) If w = 0 on 
then w = 0 on 2 by the uniqueness for the interior Dirichlet problem, 
so # = 0 by Corollary (3.29). Thus the map 


B= (wlQh,..., w|Qn!) 
from W. to C”’ is injective and hence an isomorphism. Therefore: 


(3.37) Proposition. 

The spaces V_ and W. have dimension m’. For each (a,,..., 4m‘) € cm 
there is a unique 8 € WL _ such that the single layer potential w with 
moment § satisfies w|Q, = a; for j =1,...,m! and w|Q% =0. 


We need one more technical result. 
(3.38) Proposition. 


L(S) = Vi @ Wy = Vi ®W_. (The direct sums are not necessarily 
orthogonal.) 
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Proof: By Proposition (3.35), V+ is a closed subspace of codimension 
m and dim Wy, = m, so for the first equality it is enough to show that 
Vi NW, = {0}. Suppose ¢ € Vi NW4; then TZ ¢ = $¢, and by Corollary 
(0.42), 6 = —49+ Tyy for some p € L?(S). By Proposition (3.13), 6 and 
are continuous. Let u and v be the single layer potentials with moments 


¢ and y. Then by Theorem (3.28), 
Oo-u=0, Ov = $= 464+TKb = 4. 


Multiplying the first equation by v and the second by u, subtracting, and 
integrating over S, we obtain 


[co -v—vd,_u) = [vasa 
Ss s 


By Green’s identities, the left hand side equals 


[ude —vAu) =0, 
2 


while the right hand side equals 


-f (uAu + |Vul?) =--f IWul?. 
nN! feu 


(The application of Green’s identity on 2’ needs some justification, which 
we shall give below.) Therefore u is locally constant on 2’, so ¢ = O,4u = 0. 

The proof that L?(S) = VL ® W.. is much the same: again it suffices 
to show that if 6 € VN W~_ then ¢ = 0. But for such a ¢ we have 
Tx = —$¢ and ¢ = $¥ + Tx for some 9, so if we let u and v be the 
single layer potentials with moments ¢ and y, it follows that 0,4u = 0 and 
Ov4v = @ = 0,u. Hence 


[dee — v0pyu) = I udp. 
Ss Ss 


By Green’s idenities, 
O=-— | (uAv—vAu) = i [Vul’, 
2 a 


so u is locally constant on © and thus ¢ = 0,-u= 0. 

To justify these uses of Green’s identities on the unbounded region 1’ 
we replace / by 2'N B,(0) and let r — oo as in the proof of Proposition 
(3.4). To make this work it is enough to know that u is harmonic at infinity, 
so that u and its radial derivative satisfy the estimates of Propositions 
(2.74) and (2.75). Harmonicity at infinity is automatic when n > 2, and 
for n = 2 it is equivalent to the condition [5g = 0 by Lemma (3.31). 
The latter condition is valid when ¢ € vt by Proposition (3.34) since 
Ss%= Dy (él a;), and it is valid when ¢ € W_ by Lemma (3.30). | 
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(3.39) Corollary. 
L?(S) = Range(—$1 + Tx) ® V4 = Range($I +7) @V-. 


Proof: Since Range(—}J + 7x) = Wy and Range($J + Tx) = W+ 
by Corollary (0.42), as in the proof of Proposition (3.38) it suffices to show 
that Wi NV, = WLiNV_ = {0}. Suppose ¢ €E Wi NV. By Proposition 
(3.38) we can write ¢ = ¢, + d2 where ¢; € Wy and do € Vi: But 
(¢|¢1) = 0 since ¢ € Wt and (4| 42) = 0 since ¢ E V4; hence (¢|¢) = 0, 
so ¢ = 0. Likewise WL NV_ = {0}. 1 


Finally, we come to the theorem for which this whole chapter has been 
preparing. 


(3.40) Theorem. 
With the notation and terminology of §3A, we have: 

a, The interior Dirichlet problem has a unique solution for every f € C(S). 

b. The exterior Dirichlet problem has a unique solution for every f € 
C(S). 

c. The interior Neumann problem for f € C(S) has a solution if and only 
if fon, f =0 forj =1,...,m. The solution is unique modulo functions 
which are constant on each Q;. 

d. The exterior Neumann problem for f € C(S) has a solution if and only 
if fag f = 0 for j = 1,...,m! and also for j = 0 in case n = 2. The 
solution is unique modulo functions which are constant on 04,...,Q1,, 
and also on 6 in case n = 2. 


Proof: We have already proved the uniqueness and the necessity of 
the conditions on f in Propositions (3.1-6), so all that remains is existence. 

For (c) we simply observe that Son, f = (f|a;), so these integrals 
vanish if and only if f € V+. By Corollary (0.42), this is necessary and 
sufficient to solve the integral equation -3¢ +Ti¢= f. If ¢ is a solution, 
then ¢ is continuous by Proposition (3.13), so by Theorem (3.28) the single 
layer potential with moment ¢ solves the interior Neumann problem. 

Similarly, for (d) we have Sony £ = (f|a}) for j = 1,...,m’, so these 
integrals vanish if and only if f € V4, in which case we can solve the 
equation 1¢ +7%¢ = f and then solve the Neumann problem with the 
single layer potential with moment ¢. In case n = 2, by Lemmas (3.30) 
and (3.31) this potential is harmonic at infinity if and only if Sosy f=0, 


since Toa f is already assumed to vanish for j > 1. 
+ 
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Now consider (a). By Corollary (3.39) and Propositions (3.34) and 
(3.37) we can write 


, 


f=4¢4+Tkot > ajo, (a; EO), 
1 


where ¢ is continuous since f — }> aja‘ is (Proposition (3.13)). By Theo- 
rem (3.22), the double layer potential v with moment ¢ solves the interior 
Dirichlet problem with f replaced by io +T7K¢. Moroever, by Propo- 
sition (3.37) there exists @ € W_ such that the single layer potential w 
with moment # satisfies w{Q; = a; for j > 1 and wiQ) = 0. But then 
w|S = Bad aja‘ since w is continuous on S (Proposition (3.25)), so the 
solution of the original Dirichlet problem is u = v+w. 

When n > 2, the proof of (b) is exactly the same, with the roles of 2 
and 2 interchanged and Proposition (3.36) replacing Proposition (3 37). 
For the case n = 2, the argument needs to be modified as follows. As 
above, we can write 


as —i¢ +TK¢o+ Yo aja; (¢€ C(S), a; EC). 
1 


The double layer potential v with moment ¢ solves the exterior Dirichlet 
problem with f replaced by ~$¢+ 7K. Moreover, since )> a; = 1 on S, 
with notation as in Proposition (3.36) we can write 


m m 
Yo ajay = Do bjay +e ((b1,---.bm) EX, cE ©), 
1 1 


and there exists @ € W{ such that the single layer potential with moment 
B satisfies w|Q; = bj. wis also harmonic at infinity by Lemma (3.31), so w 
solves the exterior Dirichlet problem with f replaced by }>b;a;. Finally, 
the constant function c solves the exterior Dirichlet problem with f replaced 
by c, so the solution to the original Dirichlet problem is v + w+ c. a 


F. Further Remarks 


The classic treatise on potential theory, which has retained its value after 
more than a half-century in print, is Kellogg [31]. The reader may consult 
this work for more information on the behavior of single and double layer 
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potentials, as well as various other aspects of the subject not discussed 
here. Our treatment also owes much to the lucid exposition in Mikhlin 
(36]. 

The results of this chapter extend, with no essential change, to the 
somewhat more general case where S is assumed only to be of class C!+* 
for some a > 0; see Mikhlin [36]. Some of the arguments involving tubular 
neighborhoods need some technical elaboration, but the main difference is 
that K is a kernel of order n—1—a rather than of order n—2. (The reason is 
that the conclusion of Lemma (3.15) becomes |(z—y)-v(y)| < elz — y|!+.) 
In the limiting case a = 0, where S is assumed to be only C! or Lipschitz, 
the kernel K is of a sufficiently singular nature that the theory of §2B no 
longer applies. Indeed, only recently has the method of layer potentials 
been extended to these cases — by Fabes, Jodeit, and Riviére [13] when 
S is C), and by Verchota [54] when S is Lipschitz. These authors obtain 
sharp theorems for the Dirichlet and Neumann problems with boundary 
data in L?; their work relies on some deep results of Calderén, Coifman, 
Macintosh, and Meyer on singular integrals. See also Jerison and Kenig 
{28}. 

There are other methods for solving the Dirichlet problem with contin- 
uous boundary data that yield results under minimal regularity hypotheses 
on S, of which the most popular is Perron’s method of subharmonic func- 
tions. An exposition of this theory can be found in John [30], Treves [52], 
and Jerison and Kenig [28]; see also Kellogg [31] for some related material. 

The techniques of integral equations can also be applied to solve other 
boundary value problems for the Laplacian. (Here and in what follows we 
shall assume that S is at least of class C?.) For example, if b € C%(S) and 
f € C%(S) for some a > 0, consider the problem 


Au=0onQ, 0,.u+ bu = f on S, 


which arises in the theory of stationary heat flow. Provided b > 0 on S, a 
unique solution to this problem exists in the form ofa single layer potential 
with moment ¢, where ¢ satisfies 


¢4+TKO=f,  K'(x,y) = K(z,y)+)(z)N(z,y). 


Even if b is not positive, this equation can be solved provided f satisfies a 
finite number of compatibility conditions. See Kellogg (31, §X1.12]. 
More generally, one can consider the oblique derivative problem 


Au=0o0nQ, O,u+ bu = f on S. 
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Here 0,u = Vu-y where p is a smooth vector field on a neighborhood 
of S with -v > 0 on S, and b, f € C%(S) for some a > 0. Again, one 
attempts to find u in the form of a single layer potential, but the kernel 
On. N(z,y) which arises is no longer of order n -2. Rather, it satisfies only 
|Ou.N(x,y)| < Cle —y|'~”, so it is not integrable as a function of either x 
or y on S. However, it has certain cancellation properties which guarantee 
that the integral 


[ 8y.N (zx, y)b(y) do(y) 


is well defined for ¢ € C%(S) (a > 0) if it is interpreted in a suit- 
able principal-value sense. (The idea is much the same as in Theorem 
(2.29).) The “singular integral operator” defined in this way is not com- 
pact. Nonetheless, one can show that there is another singular integral 
operator which inverts it modulo a compact operator, so the problem is 
again reduced to Fredholm theory. 

These results were obtained by G. Giraud in a long series of papers 
in the 1930’s, in which he dealt with the Dirichlet, Neumann, and oblique 
derivative problems for general second-order elliptic operators with vari- 
able coefficients. For a description of Giraud’s work and references to 
the literature, see Miranda [37]. (This remarkable book contains an enor- 
mous amount of information about elliptic equations, often with sketchy 
or nonexistent proofs, and it concludes with a seventy-page bibliography.) 

More recently, the singular integral operators used by Giraud were stud- 
ied systematically by Calderén and Zygmund and then incorporated into 
the theory of pseudodifferential operators. The theory of layer potentials 
has found a new incarnation in a powerful method, due to Calderén and 
others, for reducing boundary value problems for quite general differential 
operators on smoothly bounded domains to the study of pseudodifferential 
equations on the boundary. (The constructions in this chapter and in the 
work of Giraud are special cases of this method, although they also yield 
detailed information that is not available in in the general setting.) See 
Hormander [27, vol. III] and Taylor [48]. 

One may expect that if we impose additional smoothness conditions 
on the boundary data, the solution will have corresponding smoothness 
properties near the boundary. This is indeed the case, and it can be proved 
by the techniques of this chapter. In Chapter 7 we shall obtain some 
results along these lines, by different methods, for the Dirichlet, Neumann, 
and oblique derivative problems for a general class of second-order elliptic 
operators. 


Chapter 4 
THE HEAT OPERATOR 


We now turn our attention to the heat operator 
n 
a -A=8- 505} 
1 


on R" x R with coordinates (z,t). It has the following physical interpre- 
tation: the temperature u(x,t) at position z and time t in a homogeneous 
isotropic medium with unit coefficient of thermal diffusivity satisfies the 
heat equation 0,u—Au = 0. (See Folland (17, Appx. 1] for a brief deriva- 
tion.) The same equation also governs other diffusion processes, such as 
the mixing of two fluids by Brownian motion. 

A few caveats about the use of the heat equation in physics: First, the 
heat equation says nothing about the microscopic physical processes that 
actually produce heat flow. It describes a limiting situation in which the 
size of atoms can be considered as infinitesimal, or — statistically speaking 
— in which it is legitimate to pass to the limit in the central limit theorem. 
It does not recognize the existence of absolute zero, since if u is a solution 
then so is u+c for any constant c. And, of course, it takes no account of 
convection effects in fluids. Nonetheless, the heat equation is very useful 
in many physical situations, and it is of great mathematical importance. 

The heat operator is the prototype of the class of parabolic opera- 
tors. These are the operators of the form 0; + Djal<2m a,(z,t)02 where 
the sum satisfies the strong ellipticity condition discussed in §7A, namely, 
(—1)™ Re Djajz2m Ga(z, t)€* > cl€/?" for some c > 0. For information 
about general parabolic operators, we refer the reader to Friedman [20], 
[21], Treves [52], and Protter and Weinberger [40]. 
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A. The Gaussian Kernel 


We begin by considering the initial value problem 
(4.1) Gu — Au=0 on R® x (0,00), u(z,0) = f(z). 


Physically, this is a reasonable problem: given the temperature at time 
t = 0, find the temperature at subsequent times. It is also reasonable 
mathematically, since the heat equation is first order in ¢. (The Cauchy 
problem for the hyperplane ¢ = 0 is certainly overdetermined, since this 
hyperplane is everywhere characteristic.) 

We can quickly obtain a solution by taking the Fourier transform of 
(4.1) with respect to the z variables. Indeed, assuming for the moment 
that f is in the Schwartz class 8, and denoting the Fourier transform of 
u(z,t) with respect to z by u(é,t), we have 


GE, t) + 407 |E[7G(E,t)= 0,  —G(E,0) = F(E). 


This is an initial value problem for a simple ordinary differential equation 
in t, and the solution is 


a(E,t) = Flee *""E"* (> 0). 


Thus u(x,t) = f * K:(x), where K,(€) = e747"l’*. By Theorem (0.25), 
this means that 


(4.2) Ki(z) = K(2,t) = (4nt)-/e F774" (t > 0). 


The function K defined on R" x (0,00) by (4.2) is called the Gaussian 
kernel (or Gauss- Weierstrass kernel or heat kernel). We note that 


K, (2) = t7"/2 Ky (t- 4/22), [ka dr = K,(0) =. 


Thus by Theorem (0.13), {Kz}:>0 is an approximation to the identity. 
(Make the substitution « = t!/? to obtain the usual formulation.) We 
therefore have: 


(4.3) Theorem. 

Suppose f € L?(R"), 1 < p < oo. Then u(z,t) = f * K,(x) satisfies 
Ou — Au = 0 on R” x (0,00). If f is bounded and continuous, then u is 
continuous on R® x [0,00) and u(z,0) = f(z). If f € L? where p < co, 
then u(.,t) converges to f in the L? norm ast — 0. 
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Actually, since K,(x) decays very rapidly as x —+ oo, f x K, makes sense 
for i < T provided only that |f(z)| < Cel#!’/4T. Under this condition, one 
easily verifies that f * K, satisfies the heat equation (by differentiating 
under the integral) and that it approaches f uniformly on bounded sets as 
t — 0 provided f is continuous. 

Moreover, since all derivatives of K(z,t) decrease rapidly as r — 00, 
we can differentiate under the integral as often as we please and conclude 
that u is C™. Thus the heat equation takes arbitrary initial data and 
immediately smooths them out. In particular, we cannot expect to obtain 
a solution for ¢ < 0 unless the initial data are already very smooth: for 
the heat equation, time is irreversible. (This is related to the second law 
of thermodynamics.) 

How about uniqueness for (4.1)? As for the Dirichlet problem for A in 
a half-space (§2G), we must impose some conditions at infinity, but here 
the conditions are much weaker. 


(4.4) Theorem. 

Suppose u is continuous on R" x [0,00) and C? on R® x (0,00), and that 
u satisfies 0;u — Au = 0 (t > 0) and u(z,0) = 0. If for every € > 0 there 
exists C' > 0 such that 


lu(z,t)|< Cel, |Veu(z,t) < Cel’, 
then u is identically zero. 


Proof: We first note that if f and g are C? functions on a domain in 
R" xR, 


HOS — AS) + f(g + Ag) = D> Oj (F0j9 — 99; f) + A(f9) = Vee F 
1 


where 


F= (fig —9f,..-,fOn9 —92nf, £9). 


Given zo € R" and to > 0, let us take f(z,t) = u(z,t) and g(z,t) = 
K(x—20, to—#). Then 0,f -Af = 0 fort > Oand jg+Ag = 0 fort < to. 
We apply the divergence theorem on the region 


Q= {(z,t):|z]}<r,a<t<b} (O<a<b<to), 


The Heat Operator 145 
obtaining (by virtue of (0.1)) 


0= Fev 
en 


= u(z,6)K(x — zo, to b)dz~ [ u(z,a)K(z — 20, to — a) dz 
lelsr lz}<r 


n b 
+ Dy J. _,uesnaxee.9 - K(2,t)d;u(z, 1) do(x) dt. 


Now let r — oo. The last sum vanishes by our assumptions on u, and since 
K is even in x we obtain 


0 = [u(-, 6) * Kt .-2](z0) — [u(-, a) * Kto-a](z0). 


As b — to, the first term on the right approaches u(xo,éo), and as a — 0, 
the second term approaches u(-,0) * 1,(zo0) = 0. Hence u(zo,to) = 0 for 
all zo € IR" and ty > 0. 1 


That uniqueness does not hold for the initial value problem (4.1) with- 
out some conditions at infinity can be seen as follows. For simplicity we 
take n = 1. If g(t) is any C® function on R, then formally (disregarding 
questions of convergence) the series 


"(t)r 2k 


( 
(z uan= (2k)i 


satisfies the heat equation. To produce a nonzero solution of (4.1) with 
f = 0, then, it suffices to produce a nonzero g satisfying g)(0) = 0 for 
all & such that this series converges nicely on R” x [0,00). The function 
g(t) = exp(—t~?), for example, will do the job; see John [30] for more 
details. 

There is another theorem due to Widder, however, to the effect that if 
f > 0, then u(z,t) = f* K,(x) is the only nonnegative solution of (4.1); see 
John [30]. This gives a satisfying uniqueness theorem for physical problems 
since temperatures on the Kelvin scale are always nonnegative. 

The Gaussian kernel can also be used to solve the inhomogeneous heat 
equation 0,u— Au = f. Let us extend K to R" x R by setting 


(4.5) K(z,t) = {ee er lela oe : ° 


We note that K is locally integrable on R" x R — in fact, integrable 
on any region whose projection on the t-axis is bounded above, since 
[\K(z,t)|dz = 1 fort > 0. 
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(4.6) Theorem. 


The kernel K defined by (4.5) is a fundamental solution for the heat oper- 
ator. 


Proof: Given ¢ > 0, set K.(z,t) = K(z,t) ift > € and K,.(z,t) =0 
otherwise. Clearly K, — K in the topology of distributions as « — 0 (by the 
dominated convergence theorem), so we must show that 0K, -AK, — 6, 
that is, for any ¢ € C&, 

(K., —O¢ — Ad) — (0,0) as e > 0. 


An elementary integration by parts shows that 


Ke -teo hee a i. K(2,t){(—0 — A)o(z, t)] dz dt 
= ia [fe — A)K(z,1)]6(2,t) dz dt+ a K(z, ola, ede 
= 0+ he K(—2,€)6(2,€) az, 
since K is even in x. But this is just the convolution 


[Ke * $(-, €)](0) = [Ke * 6(-, 0)}(0) + [Ke * [4(-, €) ~ 6(-, 0)]] (0). 


The first term on the right tends to ¢(0,0) as € + 0 by Theorem (4.3), and 
the second is bounded by 


sup [d(z, €) — oz, 0) [Kelli = ap \d(z, €) — o(z, ODI, 
which tends to zero as € > 0. I 


(4.7) Corollary. 
The heat operator is hypoelliptic. 


Proof: K is C@ except at (0,0), since K(z,t) vanishes to infinite 
order as ¢ decreases to zero when z # 0. Hence Theorem (1.58) applies. § 


Remark: The analytic version of this theorem is false, and the fun- 
damental solution K, which satisfies 0,K — AK = 0 away from the origin 
but is not analytic on the hyperplane ¢ = 0, is a counterexample. 
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We can now solve the inhomogeneous heat equation 0,u — Au = f for 
any compactly supported distribution f by taking u = f * K (convolution 
on R" x R). The hypothesis of compact support can be relaxed; here is a 
representative result. 


(4.8) Theorem. 
If f € L1(R" x R), then u= f * K is well defined almost everywhere and 
is a distribution solution of Qu — Au = f. 


Proof: We have 
t t 
u(z,t)= | [sw )K(z—y,t—s)dyds = [ f(s) * Ki_s(x) ds, 


the convolution here being on R®. Since f(-,s) € L'(IR”) for almost every 
s and ||Ky~sl|1 = 1, by Young’s inequality we have 


t 00 
cup ff UfG.s)*Keo(aldedss ff Iftys)ldvas = lish. 


It follows that the integral defining u(x,t) converges absolutely for almost 
every (z,¢) and that u is locally integrable. The proof that 0:u— Au = f 
now proceeds just like the proof of Theorem (2.21); details are left to the 
reader. i] 


Remark: An argument similar to the proof of Theorem (2.28) shows 
that if f € C*+* for some k > 0 and 0 < a < 1, then u is C*+?+? in x and 
Cktl+ in ¢ and is a classical solution of Qu —Au= f. 


One of the most amusing applications of the Gaussian kernel is Weier- 
strass’s original proof of his celebrated approximation theorem. To obtain 
the full strength of this theorem, we shall assume the following technical 
result: If S is a compact set in R” and f is a continuous function on S, 
then f can be extended to be continuous with compact support on R®. In 
general, this follows from the Tietze extension theorem, but if S is reason- 
ably nice one can usually construct a solution explicitly. (For example, if 
S is a C? hypersurface, extend f to the tubular neighborhood V of S given 
by (0.2) by setting f(a +tv(x)) = o(t) f(z) where ¢ € C(R), 6(0) = 1, and 
¢(t) =0 for |t| > $e, and then set f = 0 outside V. A similar construction 
works if S is the closure of a domain with C? boundary.) 
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(4.9) The Weierstrass Approximation Theorem. 
If S is a compact subset of R", the restrictions of polynomials to S are 
dense in C(S) in the uniform norm. 


Proof: Given f € C(S), extend f to be continuous with compact 
support on R". Given € > 0, by Theorem (4.3) we can find t > 0 so that 


sup |f « Ki(z) — f(2)| < de. 
zek" 


f* Ki(2) = (Ant)-9/? | : Ply)en#-¥P/# ay, 
supp 


Since the power series for e” converges uniformly on compact sets, we 
—lz—y|?/4t i i i 
can replace e by a suitable partial sum with an error less than 


(A4nt)"/¢/2I| fll, for « € S and y € supp f. Thus, 


sup |f * Ki(z) — 9(2)| < de, 
r€S 


where 
K n k 
—n -1 Tj — Yj ‘ 
ate) = (Anan? [pty GP Rae ay, 
supp f 0 ‘ 
But g is clearly a polynomial of degree 2K. J 
EXERCISES 


1. Complete the proof of Theorem (4.8). 


2. Suppose ui(y,t),...,un(y,t) are solutions of the one-dimensional heat 
equation Gu = Ofu. Show that v(z,¢) = []} u;(z;,t) satisfies the n- 
dimensional heat equation. (What is special about the heat equation 
that makes this work? E.g., there is no analogous result for solutions 
of Laplace’s equation.) 


3. A formal use of the Fourier transform on R" x R suggests that one 
should obtain a fundamental solution for 0, — A by taking the inverse 
Fourier transform of G(£,7) = (2air + 4m7|€|?)71. Prove that this 
works, as follows. 

a. Show that G is locally integrable on R" x R. 
b. Show that G = K where K is defined by (4.5). (Hint: Take the 
Fourier transform of K first in z, then in t. Why is this legitimate?) 
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4. Solve the one-dimensional heat equation 0;u = 02u on the region z > 0, 
t > 0 subject to the initial condition u(x,0) = f(x) (x > 0) and either 
the boundary condition (a) u(0,t) = 0 or (b) d,u(0,t) = 0. (Hint: 
consider the odd or even extension of f to R.) 


B. Functions of the Laplacian 


The Gaussian kernel has many other interesting applications in analysis 
and probability. We shall limit ourselves to one of them, namely, the com- 
putation of convolution kernels for functions of the Laplacian. 

We begin by observing that (-AfY(é) = 477|€[?f(é) for f € S(IR”). It 
follows that if P is a polynomial in one variable, 


[P(—-A)sT(E) = P(477 (E17) F(E), 


and this suggests the following general construction of functions of —A. 
Suppose y is a function on (0, 00) such that € — (47? |€[?) is a tempered 
distribution on R". Then we can define an operator y(—A) : 8 > 8’ by 


[(—-A) FTE) = V(4r IEP) FQ), ie, Y(-A)F = [¥(42?| - PANY. 


(—A) can also be expressed as a convolution operator: y(—A)f = 
f*ky where ky is the inverse Fourier transform of the tempered distribution 
€ — (407 |é|?). For example, if ¥(s) = e~** with t > 0, then «y is just 
the Gaussian kernel Ky by Theorem (0.25); thus 


fehead*f, 


which is just what one would expect by formally solving the heat equation 
Qu = Au with initial data u(-,0) = f as an ordinary differential equation 
in t. (This is essentially what we did in deriving Theorem (4.3).) 

With this information in hand, we have a method for computing the 
convolution kernel xy whenever ~ can be expressed nicely in terms of ex- 
ponential functions. More specifically, suppose 


(4.10) v(s) = i $(r)e~ 2007) dr 


for some functions ¢ and w on (0,00) with w > 0. Then, formally, 


wd) = [a(n et ar, 
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so the kernel ky should be given by 


ky(x) = fe (7) K (2,w(r)) dr. 


So far this is just a heuristic procedure which needs some justification. 
The interested reader may wish to formulate a theorem along these lines 
that encompasses a general class of ¢’s and w’s, but here we shall content 
ourselves with working out some important specific examples. 

Our first example is the negative fractional powers of —A, ¥(s) = s~8 
where 0 < ReG < gn. (The restriction ReB < in is necessary to make 
(4x7 |€{?) integrable near the origin.) We have 


ep.) [” 0-1 
sh = re * eS" dr, 
rar 


so the convolution kernel of (—A)~® should be 
1 ca p-1 
—_ Tt?" K (2,7) dr. 
rah oR 


We can compute this integral by making the substitution o = |z{?/4r: 


Lf page = aan | p-1-(n/2),-1s|?/47 
ra T K(2,7) dt = Soy aayala B T € dr 


1 co 
= (n/2)—f-1,-¢0 
“Towers | ° vee 
T(in- 8) 


~ T(B)40 17? [jn 26 


It is not hard to see that this function is indeed the inverse Fourier trans- 
form of (477|é|?)~*; see Exercise 1. (It is not quite trivial either, since 
neither of these functions is globally integrable, so one must interpret the 
Fourier transform in the sense of distributions.) 

It is customary to set B= ta and to define 


(4.11) Ra(t) = = T(3(n - a) (0 <Rea<n), 


(fa)2ex0/?|z\n-2 


so that 
sear rate Rs: 


The Heat Operator 151 


Rg is called the Riesz potential of order a. In particular, when a = 2 
(and n > 2) we see that 


T(in -1) = T(4n) - 1 


Ryle) = Ann/2|¢|n—-2 ~ (n — 2)20"/2|2jn-2 — (n — 2), [2|"-2? 


which is the fundamental solution for —A given by (2.18). 

We have thus rederived the fundamental solution for A by Fourier anal- 
ysis, at least when n > 2. When n = 2 this procedure breaks down: as 
a —+ 2, the coefficient ['(3(2 — @)) in Re(x) becomes infinite, and the 
function (47?|€|?)-%/? ceases to be integrable at the origin. However, one 
can proceed as follows. When 0 < a < 2, the Riesz potential Ra satisfies 
(—A)*/? Ro = 6. But so does Ry — cy for any constant cg, and we can 
choose Cg so as to obtain a finite limit as a — 2. Namely, let 


_ RG = ga). TG ae 
(4.12) Qa(z) = Ra(z) Tian Pte iat Lm | ~1). 

Since (1 — da) = 2T(2 — ta)/(2 — a), an application of |’H6pital’s rule 
shows that 


Q2(r) = jim Qa(z) = — = log lal, 


which is the fundamental solution for —A. Roughly speaking, Qo is ob- 
tained from Re by subtracting off an infinite constant. Correspondingly, 
its Fourier transform is obtained from (4m?|£|?)—! by subtracting off an 
infinite multiple of 6; see Exercise 2. 

Our next example is a modification of the preceding one. It is sometimes 
convenient to consider powers of I — A rather than —A so as to avoid 
confronting the singularity of |€|~* at € = 0. Accordingly, we observe that 


(1+ s)7? = =i f © ponlen(144)r dr (Ref > 0), 


so that the convolution kernel of (I — A)~* should be 


1 ae eee 
raf ro-le-T K(x, r) dr. 


Again we set @ = $a and define 


1 bs 2 
[ pia—n)/2-1,-r-|z| [ar dr 


. B., =e 
(4.13) (z) Tga\lary? Je 


152. Chapter 4 


for Rea > 0. (The integral converges for allz #0.) Bz is called the Bessel 
potential of order a. (The name comes from the fact that B, is a Bessel 
function; in fact, Ba(z) is a constant multiple of jx\o-/27 Kay sa(Izl), 
where K, is the modified Bessel function of the second kind of order p.) 
Bg is in L1(R"), for 


[Balla < ries Me Gama ee aC dx dr 


pRe(a/2)-1 6-7 ge = T(; Rea) 
Sak ar= "ral 


Because of this, it is easy to use Fubini’s theorem to show that the Fourier 
transform of Bg is indeed (1 + 4?|¢|?)~°/2 (Exercise 3), so 


(I-A)7?*f=f*Ba (Rea >0). 


We shall meet this operator again in Chapter 6. 

For our final example, we rederive the solution of the Dirichlet problem 
for the Laplacian in a half-space (see §2G) by Fourier analysis. We recall 
that the problem is to solve 


O?u+ Aru =0 fort >0, u(z,0) = f(z). 


As in §4A, we apply the Fourier transform in x to obtain an ordinary 
differential equation in ft: 


07% — 4n7{€\?G = 0 for t > 0, aE, 0) = f(g). 


The general solution of this differential equation is a linear combination of 
e27lélt and e727 lit, We reject e2*!€l! since it is not tempered for t > 0, and 
the initial condition then yields 


G(é,t) = e271" Fle), or u(x,t) = e7*¥—4 f(z). 


We therefore have u(x,t) = f * P,(x) where P; is the inverse Fourier trans- 
form of e~?*1ét| The magic formula that enables us to compute this is as 
follows. 


(4.14) Lemma. 
If B20, 
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Proof: A standard application of the residue theorem yields 


°° Bs 
eA = =| ——, ds, 
-~o l+s 


and obviously 


1 = - —(l+s?)r g 
ya a Pia . 


Hence, by Fubini’s theorem and Theorem (0.25), 


Ve fh per ts 2 
= ~ [ te e's e~T? 6-7 dsdr 
=2[" ie e27Be . —4rx70? e-’ dodr 


: e7F 7/47, =r 
-[ 3 Tae dr. | 


Now if we set § = t,/s in Lemma (4.14), we obtain a formula of the 


form (4.10), - ae 
~tJ/s _ —t7s/4r 
e€ = [ ar dr, 


so the inverse Fourier transform P; of e~?7lélt should be given by 


Pi(z) = [ Fak (a,th/Ar) ar. 


To verify this and evaluate the integral, simply take @ = 27|é|t in Lemma 
(4.14) and use Fubini’s theorem and Theorem (0.25): 


Pa) = | hn. eM IPOH de dr 


ev lt 2/42 
=[-< Var =(3)"" a 


eee (n-1)/2, —r(it|z}?/t?) 
=e [ T dr 


] oo 
oe (n-1)/2,-0 
- aT EEO w eras 


~ M(n+1)/2) é 
~ (nm $1)/2 (t? + |x|2)@+1)/2 si 


Since T'((n + 1)/2)/et0/? = 2/un41, we have recovered the formula 
(2.43) for the Poisson kernel, and we have proved: 
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(4.15) Theorem. 
If P, is the Poisson kernel defined by (2.43), then P,(€) = e~2*#lél, 


This result gives an easy new proof of the semigroup property Pky, = 
P, * P,, for it is obvious that Piss = PB. It also makes clear how the 
Dirichlet and Neumann data of the harmonic function u({z,t) = f * P,(x) 
determine each other. Indeed, we have u(-,0) = f, and 


[—OuJ(E,0) = 2alélen2*€l* F(e)|_, = Qmlel F(E). 


That is, if g = —d,u(-,0) then g = (—A,)'/?f; in other words, f = 
(—A,)~1/2g = g * Ry where R, is the Riesz potential given by (4.11). 


EXERCISES 
1. By Theorems (0.25) and (0.26), for any ¢ € S(IR”) and r > 0 we have 


Perl Geyan = 0? ferret. 


For 0 < Rea <n, multiply both sides by r("-%)/2-1 dr and integrate 
from 0 to oo to obtain 


we / [n|°-"4(2) dx = HO) / lel" *4(€) ae. 


Conclude that if Ry is given by (4.11), then Ro(€) = (Qn|é|)~%, the 


Fourier transform being interpreted in the distibutional sense. 


2. From Exercise 1 and the final paragraph of §0B, if Qa (0 < a < 2) is 
given on IR? by (4.12) we have 


T(1— 3a) 


Qa(é) = (2x|é|)~% - T(layoem 
2 


6(é). 


Show that 

Ta- 3a) - i dé 

T(ga)2em Sige (2zl€l)* 
approaches a finite limit c as a approaches 2 from below, and conclude 
that the Fourier transform of Q2(z) = (~27)~! log |z| is the distribution 
given by 


~ fo) — 4(0)-€-V4(0) o(€) 
ecg je Ge a (arly? 


dé — c9(0). 
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(The first integral is absolutely convergent because the numerator is 
O(|é|?) by Taylor’s theorem. The term & - V6(0)/(27|€[)? is really a 
phantom, since it is an odd function and so its integral over the unit ball 
vanishes formally. Thus, intuitively we have Q2(€) = (2n|€|)-? — C6(é) 
where C is the infinite constant fj. .,(2m|é|)~? dé +c.) 


3. Show that if Rea > 0 and Ba is given by (4.13), then By(é) = 
(1+ angi) 2/2, 


C. The Heat Equation in Bounded Domains 


If one wishes to study heat flow in a bounded region of space 2 C R” 

over a time interval 0 < ¢ < T < oo, it is appropriate not only to specify 

the initial temperature u(z,0) (x € 2) but also to prescribe a boundary 
condition on @Q x [0,7]. For example: 
i. u = g (the temperature on the boundary is specified); 

ii. O,u = 0 (the boundary is insulated; there is no heat flow into or out of 
Q); 

iii. 0,u+¢c(u—uo) = 0 (Newton’s law of cooling: outside Q the temperature 
is maintained at uo, and the rate of heat flow across the boundary is 
proportional to u — uo). 

The first basic result. concerning such problems is the following maximum 

principle: 


(4.16) Theorem. 

Let Q be a bounded domain in R® and 0 < T < co. Suppose u is a real- 
valued continuous function on Q x [0,T] that satisfies 0;u— Au = 0 on 
Q x (0,7) (and hence is C® there). Then u assumes its maximum either 
on 2 x {0} or on 02 x [0,7]. 


Proof: Given « > 0, set v(z,t) = u(z,t) + €|z|?. Then Q0 — Av = 
~2ne < 0. Suppose 0 < 7’ < T. If the maximum value of v on © x (0,7) 
occurs at an interior point, the first derivatives of v vanish there and the 
pure second derivatives aFu are nonpositive. In particular, 0:0 = 0 and 
Av <0, which contradicts Ou — Av < 0. Likewise, if the maximum occurs 
on Q x {T"}, the t-derivative must be nonnegative and the pure second 
z-derivatives must be nonpositive, so Gv > 0 and Av < 0, which again 
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contradicts 0,v — Av < 0. Therefore, 


max u< max v= max v 
Ax[o,7'] Axor} (x {0})U(aNx[0,T'}) 
< max u-+emax|z|?. 
(Ax {0})u(anx[0,7}) a 


Letting « — 0 and T’ — T, we obtain the desired result. ] 


Replacing u by —u, we see that the minimum is also achieved on either 
Q x {0} or OQ x [0, T}. Therefore: 


(4.17) Corollary. 

There is at most one continuous function u on Q x [0, T] which agrees with 
a given continuous function on Q x {0} and on 8Q x [0,7] and satisfies 
Ou — Au =0 0n Q x (0,7). 


(In particular, note that the “Dirichlet problem” is overdetermined: the 
boundary values on Q x {T} cannot be specified in advance.) 

Let us look more closely at the problem obtained by holding the bound- 
ary at a constant temperature, which we may assume to be zero by adjust- 
ing our temperature scale: 


Qu — Au =0 on Q x (0,00), 


(4.18) u(z,0)= f(z) (rEQ), u(z,t)=0 (2 EAN, t E (0,00). 


This problem can be solved by the method of separation of variables. That 
is, we begin by looking for solutions of the differential equation of the form 
u(x,t) = F(x)G(t). For such a function we have 0,u—Au = FG'-(AF)G, 
so the heat equation implies that G'/G = (AF)/F. But G'/G isa function 
of t alone, and (AF)/F is a function of x alone, so both these quantities 
must be some constant . Thus G! = \G, so G(t) = Ce", and AF = AF. 

Suppose we can find an orthonormal basis {Fj} for L?(Q) such that 
AF; = 4;F; and Fj(z) = 0 for x € OQ. We can then solve (4.18) by 
expanding f in terms of the F,’s, say f = }>a;F;, and taking 


(4.19) u(z,t) = So aj Fj (z)e™*. 


Provided that the ;’s are nonpositive, this series will converge in L?(Q) 
for each t > 0 and will satisfy u(x,t) = 0 for z € A as best it can. 
(For example, if f € H?(Q) as defined in §2F, then u will vanish on 69 x 
(0,00) in the sense of Corollary (2.40).) Also, one easily verifies that u is 
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a distribution solution of the heat equation (Exercise 1), hence a classical 
solution by Corollary (4.7). 

The problem (4.18) is therefore reduced to finding an eigenbasis for the 
Dirichlet problem for the Laplacian. We have done this in §3H when 22 is 
a ball. We will show in §7E that such an eigenbasis exists for a general 
bounded domain Q with smooth boundary and that the eigenvalues are all 
negative. 

Similar considerations apply to the insulated-boundary problem 


Ou —- Au=0 on 2 x (0,00), 
u(z,0)= f(z) (2 €Q), O,u(z,t)=0 (zx € AQ, t E (0,00)). 


This problem boils down to finding an orthonormal basis {H;} for L?(@) 
such that AH; = yj Hj (uj; < 0) and 0,H; = 0 on OQ. Again, we have 
seen how to do this when Q is a ball, and we will prove the existence of 
such an eigenbasis for a general Q in §7E. 

Finally, we solve the initial value problem for the heat equation on 
the circle (physically: an insulated loop made of homogeneous material of 
negligible diameter). We think of the circle as the set of complex numbers 
of modulus one and identify it with the unit interval by the correspondence 
x +e?" We then wish to solve the problem 


O,u — O2u=0 fort > 0, 


2 
on u(z,0)= fle), w(e-+1,t) = u(t), 


where f(z +1) = f(z). The natural tool here is Fourier series, i.e., expan- 
sions with respect to the orthonormal basis {e?"#*7}*, for L?(0,1). (One 
would be led to this in any event by separation of variables.) Indeed, if we 
look for a solution of (4.20) in the form u(x,t) = 0°, Ck(t)e?****, the 
heat equation implies that Ci(t) = —4a?k?C;,(t), and the initial condition 
means that C;(0) must be the kth Fourier coefficient f(k) of f. Therefore 
Cy(t) = F(k)e47"*"*, and 


CO co 1 
u(z,t) = Se fee niente = | Ply)e2tE(e—v)—4n74*t dy 
=e < Jo 


= [f+ %.0]@), 
where . 
V(z,t)= > e2rike—4n7k7t 
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and * means convolution on the circle group R/Z. (The sums are rapidly 
convergent for ¢ > 0 because of the factors e~4* *”*, so the formal manipu- 
lations are justified. In particular, J and u are C™ functions for t > 0 and 
satisfy the heat equation.) 

The function J therefore plays the same role for the heat equation on 
the circle as the Gaussian kernel K does on R®. Like K, J has a signif- 
icance which reaches far beyond the study of heat flow: it is essentially 
one of the Jacobi theta functions, which have deep connections with ellip- 
tic functions and number theory. (More precisely, in the notation of the 
Bateman Manuscript Project [4], J(z,t) = @3(x|47it).) 


EXERCISES 


1. Suppose { F;} is an orthonormal basis for L?(Q), A; < 0, and Y |a;|? < 
CO. 
a. Show that the series (4.19) converges in the topology of distributions 
on 2 x (0,00). 
b. Show that if AF; = A; Fj, the series (4.19) satisfies the heat equa- 
tion on Q x (0,00) in the sense of distributions. (This is almost 
immediate from part (a).) 


2. Suppose {Fj} is an orthonormal basis for L?(Q) such that AF; = ; F; 
(Aj < 0) and F; = 0 on @Q. Solve the inhomogeneous heat equation 


Ou ~ Au = g(x, t) on Q x (0,00), 
u(z,0)=0 (rEQ), u(z,t)=0 (x € AQ, t € (0,00)) 


in terms of the basis {Fj}. (The solution will look like (4.19), but with 
e*3* replaced by the solution of an inhomogeneous ordinary differential 
equation.) 


3. Suppose Q is a bounded domain with C! boundary, and suppose u is 
a C! function on Q x (0,T) that satisfies Q:u — Au = 0 on 2 x (0,T) 
and either of the boundary conditions u = 0 or 0,u = 0 on 00 x (0,T). 
Show that fy, |u(zx, ¢)|? dz is a decreasing function of t. (Hint: Observe 
that u(du— Au) = $6;,(|ul?) — V - (@Vu) + |Vul? and integrate over 
2.) 


Chapter 5 
THE WAVE OPERATOR 


The last of the three great second-order operators is the wave operator 
or d’Alembertian 


n 
ap -A= 07-5 8} 
1 
on IR" x R. As the name suggests, the wave equation 
a7u— Au=0 


is satisfied by waves with unit speed of propagation in homogeneous iso- 
tropic media. (Actually, in most cases such as water waves or vibrating 
strings or membranes, the wave equation gives only an approximation to 
the correct physics that is valid for vibrations of small amplitude. However, 
it is an easy consequence of Maxwell’s equations that the wave equation is 
satisfied exactly by the components of the classical electromagnetic field in 
a vacuum.) The characteristic variety of the wave operator, 


x= {(é,7) # (0,0)ER"xR:7r?= lél?}, 


plays an important role in the theory. It is called the light cone, and the 
two nappes {(€,7T) € L: r > 0} and {(€,7r) € U: r < O} are called the 
forward and backward light cones. 

The wave operator is the prototype of the class hyperbolic operators. 
(There are several related definitions of hyperbolicity for general differen- 
tial operators, of which perhaps the most widely useful is the following. An 
operator L = }paiaj<k qj (2, t)020! on R" x R is called strictly hyper- 
bolic if, for every (z,t) € IR" xR and every nonzero € € R", the polynomial 
P(r) = Viatejce aq(z,t)€%r? has k distinct real roots.) There is an ex- 
tensive literature on hyperbolic equations, of which we mention only a few 
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basic references: John [30] for a brief introduction; Courant and Hilbert 
{10], ormander [26], [27, vol. II], John [29], and Bers, John, and Schechter 
[7] for accounts of various aspects of the classical theory; and Garding [22], 
Treves (53, vol.II], Hormander [27, vols. III and IV], and Taylor [48] for 
more recent developments using the machinery of pseudodifferential oper- 
ators and Fourier integral operators. 


A. The Cauchy Problem 


The basic boundary value problem for the wave equation is the Cauchy 
problem. We know from our analysis in §1C that the initial hypersurface S 
should be non-characteristic in order to produce reasonable results. How- 
ever, for n > 2 this condition is not sufficient. Indeed, recall the Hadamard 
example 
u(x,,22) = e~V*(sin kx1)(sinh kr) 

which shows that the Cauchy problem for A in R? on the line x2 = 0 is 
badly behaved. (See (1.46).) If we think of u as a function on R™xR (n > 2) 
that is independent of z3,..., 2, and ¢, then u satisfies 02?u — Au = 0 and 
the initial conditions 


u(z1,0,23,...,%n,t} = 0, O2u(21,0,23,...,%n,t) = ke7¥* sin kr, 


on the non-characteristic hyperplane zz = 0. As k —+ 00, the Cauchy data 
tend uniformly to zero along with all their derivatives, but u blows up for 
rq # 0. 

We can generalize this example. Let S be the hyperplane v’-x+vot = 0 
through the origin with normal vector 


v= (v',v¥o) = (4,..-,Un, Vo). 
Suppose there is a vector (ji1,..-, Hn, Ho) = (#’, to) such that 
(5.1) f(z,t) = exp[i(p! - 2 + pot) + uv’ - x + vot] 
satisfies 0? f - Af = 0. Since 6? — A is real, the imaginary part 
sin(p’ -z + pot) exp(v’ - x + vot) 


also satisfies the wave equation. Moreover, since the wave operator is in- 
variant under the transformation (x,t) — (—z,—t), the even part 


sin(p’ - x + pot) sinh(v’ - x + vot) 
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still satisfies the wave equation. Finally, for any k > 0, 
u(z,t) = e7V* sin k(p’ - + pot) sinh k(v’ - x + vot) 
will satisfy the wave equation with Cauchy data 
u(z,t) =0, Oyu(z,t) = ke~V* sin k(y! «2 + pot) 


on S. As k — oo we obtain the same pathology as before. So when can 
this happen? 


(5.2) Proposition. 
There exists (p', 0) # (0,0) € IR" x R such that (5.1) satisfies the wave 
equation if and only if 
i, Up = +My (ie., the line vot + vx = 0 is characteristic), ifn = 1; 
ii. [vo| < |v'|, ifn > 2. 


Proof: (5.1) satisfies the wave equation precisely when 


(Yo + ito)? — SO (5 + in)? = 0. 
7 


ifn = 1, this just means that vp + ivo = (41 + tpy), ie., that vp = +, 
and po = +y1. If n > 2, we take real and imaginary parts: 


ve~po- |‘? +14? =0, pov pv’ = 0. 


In case vp = 0 (so in particular |vo| < |v’|), we can choose fp = 0 and 
p’ any n-vector of length |v’| perpendicular to v’. If vo # 0, we have 
Ho = Yo pv, so 

v9 — [e‘|? = 9 — [HP = v9 2! ev")? — fu’? 


<7 ele’? — |e P = vw’ P(e‘? — va). 
This forces v2 ~ |v’|? < 0, i-e., [vol < [v’|. On the other hand, if this 


condition is satisfied, we can take fo = 1 and y’ any vector of length 
(1 + |v}? — v2)!/? making an angle with v’ equal to 


Y 


arccos er - 
|v!|( + |e? — 95)? 


(This works since the argument of arccos lies in [~1, 1].) 1 
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This leads to the following definition. The hypersurface S in R" x R 
is called space-like if its normal vector v = (y',v) satisfies |vo| > |v’ 
at every point of S, that is, if v lies inside the light cone. The preceding 
argument suggests that for n > 2, the Cauchy problem on S will be well 
behaved if and only if S is space-like, and this turns out to be so. We shall 
restrict our attention to the most important special case, where S is the 
hyperplane ¢ = 0, and make some remarks about more general S at the 
end of the section. 

We remark to begin with that the wave operator (unlike the heat oper- 
ator) is invariant under time reversal (x,t) — (z,—t). It therefore suffices 
to consider solutions in the half-space t > 0, as similar results may be 
obtained for t < 0 be replacing t by —t. Our first result is a uniqueness 
theorem. 


(5.3) Theorem. 
Suppose u(z,t) is C? in the strip 0 <t < T and that 0?—Au= 0. Suppose 
also that u = 0,u = 0 on the ball 


B= {(x,0): |x — zo| < to} 


in the hyperplane t = 0, where zo € R" and 0 < to < T. Then u vanishes 
in the region 


Q= {(x,t):0<t < to and |[z — zol < to — t}. 
(Note that 2 is the [truncated] cone with base B and vertex (zo, to), 


or in other words, the region in the strip 0 < t < to that is inside the 
backward light cone with vertex at (zo, to). See Figure 5.1.) 


Figure 5.1. The regions in Theorem (5.3). 
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Proof: By considering real and imaginary parts, we may assume that 
u is real. For 0 <¢ < to, let 


B, = {z: |z—z0| < to — t}. 


We consider the integral 


E(t) = if [Ve,cu(2, t)|[? dz = af (4) +>> (#)| dz, 


which represents the energy of the wave in the region B; at time t. The 
rate of change of E(t) is 


dE du Ou du du ' ‘ 
dt Jp, Ee ov + 29a; Ox; Ox; Ot eat) 4 ae Te atilne: 


(The second term comes from the change in the region B;; see Exercise 1.) 
Now we observe that 


50) OU Oh, 2 ee 
Gz; |dx; dt | ~ Oz; dajdt * du? dt’ 


so by the divergence theorem, 
i du Pu pee du 6?u a] ¢ 
B, | Ot at? Ox; Ox; At 
Ou | 07u u du Ou 
=f ot Ea] at f Oe at Oz; 82; 


where v is the normal to B, in R®. The first integrand on the right vanishes, 
and for the second we have the estimate 


Ou Ou Ou au [?\1? 
B32," <|3¢| (loss ) 
+ (|) Ou *) 1 4 
<3 (SP IA) = piven, 
2\t at y Oz; 2 ; 


by the Schwarz inequality and the fact that 2ab < a? + b?. Therefore, 


dE ( Ou Ou 
aB 


pecanie eae hoeopasoalery meee § 2 
ai dt dz, ie ERD | ) do <0. 


But clearly E > 0, and #(0) = 0 because the Cauchy data vanish. Hence 
E(t) = 0, so Venu = 0 0n N= {(2,t): 2 € By}. But u(z,0) =0,sou=0 
on 2. | 
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This is a very strong result. It shows that the value of a solution u 
of the wave equation at a point (ro,t9) depends only on the Cauchy data 
of u on the ball {x : [x — xo| < to} cut out of the initial hyperplane by 
the backward light cone with vertex at (z9,to). (This expresses the fact 
that waves propagate with unit speed.) Conversely, the Cauchy data on 
a region RF in the initial hyperplane influence only those points inside the 
forward light cones issuing from points of R. 

Similar results hold when the hyperplane ¢ = 0 is replaced by a space- 
like hypersurface S = {(x,t) : t = ¢(z)}. (The condition that S is space- 
like means precisely that |V¢| < 1.) Indeed, the change of variable t/ = 
t — $(z) transforms the Cauchy problem for the wave equation on S to the 
Cauchy problem for another differential equation Lu = 0 on the hyperplane 
t! = 0. The fact that S is space-like guarantees that LD is still strictly 
hyperbolic, and one can apply the theory of general hyperbolic operators; 
cf. the references in the introduction to this chapter. See also Exercise 3 
for the case where S is a hyperplane. 


EXERCISES 


1. Show that if f is a continuous function on R® and z € R”, 


d 
af foav= [Mode 


2. Suppose u is a C? solution of the wave equation in IR" x IR. Show that if 
u(-, 49) has compact support in R” for some tp then u(-,t) has compact 
support in IR" for all t, and adapt the proof of Theorem (5.3) to show 
that the energy integral E = f,.|Vz,ul? dz is independent of t. 


3. Suppose v = (v’, vp) is a unit vector in R" x R with |v'| < |vol, so that 
the hyperplane S = {(z,t):v'-2 + vot = 0} is space-like. 
a. Show that there is a linear transformation of IR” x R that maps S 
onto the hyperplane ¢ = 0 and has the form T = 7)7, with 
Ti(z,t) = (Ra, t), where R is a rotation of R"; 
To{(z,t) = (z},22,...,2n,t’), where 
zi = 2,cosh@+tsinh@, t' = 2, sinh 6 + tcoshé (8 € R). 
b. Use Theorem (2 1) and Exercise 2 of §2A to show that if T is as in 
part (a) then (0? — A)(uoT) = [(6? — A)uJ oT. 
c. Conclude that the Cauchy problem for the hyperplane S can be 
reduced to the Cauchy problem for the hyperplane ¢ = 0 by com- 
position with the transformation T. 
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B. Solution of the Cauchy Problem 


In this section we shall construct the solution of the Cauchy problem 


é2u~ Au=0, 


(5.4) u(z, 0) = f(z), O,u(z,0) = g(z). 


We start with the one-dimensional case, which is very simple. First, 
we observe that if ¢ is an arbitrary locally integrable function of one real 
variable, the functions us(2z,t) = ¢(z +1) satisfy the wave equation, for 
OPus = O2us = $(2 +t). (More precisely, if ¢ is C? then uz is a classical 
solution of the wave equation, while if ¢ is merely locally integrable, u is 
a distribution solution.) Conversely, it is easy to see — at least on the 
formal level — that any solution of the one-dimensional wave equation is 
of the form ¢(x +t) +(x —t) where ¢ and w are functions of one variable. 
Indeed, if we make the change of variables € = r+t, 7 = 2-1, the chain 
tule gives 0, = O¢ + 0, and & = O¢ — ,, so that 0? — 02 = —40¢0,, and 
the wave equation becomes 0¢0,u = 0. To solve this, we integrate in 7, 
obtaining Ogu = (€), where @ is an arbitrary function, and then integrate 
in €, obtaining u = $(€) + Y(7) where ¢’ = © and y is again arbitrary. 

With this in mind, it is easy to solve the Cauchy problem (5.4) when 
n= 1. We look for a solution of the form u(z,t) = ¢(2 +t) + ¥(z — 1%). 
Setting ¢ = 0, we must have ¢+y = f and ¢/—y’ = gq. Hence ¢’ = 3(f’ +9) 
and yy’ = a(f —g), so 


o(z) = 4 f(a) +4 / a(s)ds+Ci, Wa) = A(z) -4 is " g(s)de+ Cn, 
oO 0 


where C; + Cz = 0 since 6 + » = f. Therefore, 


r+t 
(5.5) u(z,t) = terns se-ol+4 [ ; g(s) ds. 
e- 

It is a simple exercise to verify that this formula really works. To be precise, 
if f is C? and g is C1 on R", then u is C? on IR” x R and satisfies (5.4) 
in the classical sense; if f and g are merely locally integrable, u satisfies 
the wave equation in the sense of distributions and the initial conditions 
pointwise. In fact, one can take f and g to be arbitrary distributions on 
R; ES g(s)ds is then to be interpreted as the distribution g * x, where x 


is the characteristic function of [—t,t]. We leave the details to the reader 
(Exercise 1) and summarize our results briefly: 
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(5.6) Theorem. 
Ifn = 1, the solution of the Cauchy problem (5.4) is given by (5.5). 


The situation in space dimensions n > 1 is a good deal more subtle. 
We shall construct the solution when n is odd by using a clever device to 
reduce the problem to the one-dimensional case, and then obtain the even- 
dimensional solutions by modifying the odd-dimensional ones. As in the 
one-dimensional case, we shall first proceed on the classical level, assuming 
that all functions in question have lots of derivatives, and then observe that 
the results also work in the setting of distributions. 

If ¢ is a continuous function on R", z € R", and r > 0, we define the 
spherical mean M,(z,r) to be the average value of ¢ on S,(z): 


1 
M4(z,r) = ra ee $(z) do(z). 
The substitution z = 2 +ry turns this onto 
1 
(5.7) Mg(z,r) = — o(z + ry) do(y), 
Wn Syf=1 


which makes sense for all r € R. Accordingly, we regard My as the function 
on IR” x R defined by (5.7). As such, it is even in r, as one sees by making 
the substitution y — —y, and it is C* in both x and r if ¢ is C*, as one 
sees by differentiating under the integral. Moreover, My(-,0) = ¢. 

Mg satisfies an interesting differential equation which may be derived 
as follows. If T is any rotation of R", by Theorem (2.1) we have 


A,[6(z + Ty)] = [A¢|(z + Ty) = Ay[o(z + Ty)], 


where A, and Ay denote the Laplacian acting in the variables z and y. 
We now average these quantities over all rotations T. Since the average of 
o(z + Ty) over all rotations is M4(z, |y|), we obtain 


A,Mg(z, ll) = Ay Mg(z, lyl). 
Therefore, by Proposition (2.2), 


n-1 


(5.8) A.Mg(z,r) = [et + a,| Mg(z,r). 

To make this argument complete we should explain carefully what it means 
to average over all rotations; instead, we shall give an alternative derivation 
that finesses this problem. 
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(5.9) Proposition. 
If $ is a C? function on R", then Mg satisfies (5.8) on R” x R. 


Proof: It suffices to consider r > 0, since both sides of (5.8) are even 
functions of r. First, by (0.1) and the divergence theorem, 


1 
a.Mol2.r)= [So md d(e+ rv) dol) 
nm y = 
= en rAd¢(z + ry) dy 
Wn Jlyf<1 
es est Ad¢(a + z) dz. 


md 
mln Jal<r 


Multiplying by r"—! and expressing the integral in polar coordinates, we 
obtain 


1 r 
m1a,Moer) = [fade + eu)e"™! doy) dp, 
“n Jo J\y|=1 
so 
8, [r°-10, Mg(2, r)] = + Ag(a + ry)r"—! do(y) = r®7 "Ar Mg(z,r). 
n Jlyf=1 
The desired result follows by working out the derivative on the left and 


dividing by r?7?. i] 


(5.10) Corollary. 

Suppose u(x,t) is a C? function on R" x R, and let M,(x,7,t) denote 
the spherical mean of the function x — u(z,t). Then u satisfies the wave 
equation if and only if M,, satisfies 


(5.11) E a m4] M, (2, r,t) = 0? Mu(z, r,t) 
r 


for each zx € R". 


Proof: We have merely to apply Proposition (5.9) after observing 
that A,My = Mau and 07My = Mazu. i 


When n is odd, the differential equation (5.11) can be reduced to the 
one-dimensional wave equation by means of the following identity. 
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(5.12) Lemma. 
Ifk > 1 and ¢ € C*t1(R), then 


(5.13) 82 (r7 19,)F-lp2k-lg(y)] ae (r718,)¥{r?* '(r)]. 


Proof: It is easy to verify directly that (5.13) holds when ¢(r) = r™ 
and hence when ¢ is any polynomial, and that both sides of (5.13) vanish at 
ro when ¢ and its derivatives of order < k+1 vanish at ro. But by Taylor’s 
theorem, for any rg we can write ¢ = P + R where P is a polynomial and 
R vanishes to order k + 1 at ro, so (5.13) follows in general. | 


The right side of (5.13) equals 
(771, JF 1p? 1g !"(p) a Qkr?*-24'(r)}, 
so if we define the differential operator T;, by 


Te@(r) = (r778,)*-[r7F-*6(r)], 


(5.13) says that 
OT = T, (02 + 2kr-10,)¢]. 


Thus, ifn = 2k +1, 7, intertwines the operator occurring on the left of 
(5.11) with 0?, so applying 7; to both sides of (5.11) converts (5.11) into 
the one-dimensional wave equation. 

For future reference we make one more observation about the operator 
Ty. In Td there are 2k — 1 powers of r in the numerator and k — 1 in 
the denominator, and k — 1 derivatives. Expand 7,.¢ by the product rule: 
if j derivatives hit ¢, then k — 1—j derivatives must hit the powers of r, 
leaving a factor of r to the power (2k — 1) — (k — 1) —(k-1—j) =j 41. 
Therefore, 


k-1 


T(r) = o crt g@(p), where 


0 
cor = (r7'0, E71 n2k#-1 = 1.3---(2k — 1)r. 


(5.14) 


At last we are in a position to solve the Cauchy problem (5.4) when 
the space dimension n is odd and > 1. We shall start by assuming that a 
solution exists and deriving a formula for it, and then we shall show that 
the formula always gives a solution. Suppose, then, that u satisfies (5.4), 
and suppose for the moment that u, f, and g are smooth — at least of 
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class C("+3)/2_ By Corollary (5.10), the spherical mean M,, satisfies the 
differential equation (5.11) with initial conditions 


M,(2,r,0) = M;(z,r), &,Mu(2,r, 0) = M,(z,r). 


Thus, if we set 
u=TM,, f=TM;, g=TM,, 


where 


T(-) = Ten-aypa-) = (7728, ) O79) f"-7(.)] 


the remarks following Lemma (5.12) show that 


07 7i(2, r,t) = 02%(z, r,t), 


U(z,r,0)= f(z,r), U(x, r,0) = H(z, r). 
The solution to this problem is given by (5.5), 


= r+t 
U(2z,r,t) = ‘Al (z,r+t)+ f(z,r—t)} + i/ 9(z,s) ds, 
rot 


so it remains to recover u from u. In principle, M, is obtained from @ 
by undoing the operator T, i.e., by successive integrations, and then u is 
obtained from M, by setting r = 0. However, one can take a shortcut by 
using (5.14) with ¢(r) = Mu(z,r,t) and k = $(n— 1): 

. w t 

u(z,t) = M,(z2,0,t) = lim Men 

r—0 cor 

where co = 1-3---(m — 2). Moreover, since My and Mg are even functions 


of r, f and g are odd, and 8, f is even; hence, by I’H6pital’s rule, 


1 os sas rtt 
u(z,t) = lim Dae [Fe. r+t)+f(z,r—-t)+ de g(x, s) as 


= 5 [ODE Mart @Ale |. +H -H2,-9] 


= = [OAenlo.+He.9). 


If we unravel the definitions of f. g, and co, we obtain the promised formula 
for u in terms of f and g, which we state explicitly in the following theorem. 
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(5.15) Theorem. 

Suppose n is odd and n > 3. If f € C+3)/2(R") and g € C("+1)/2(R7), 
the function 

(5.16) 


u(x,t) = 1-3-- a — wn Jacetano-o aye F(z+ty) do) 


+ (t718,)0-9)/2 (ae 
y 


solves the Cauchy problem (5.4). 


; g(x + ty) in(v)| 


Proof: Up to a constant factor, the second term on the right of (5.16) 
is 
u(x,t) = (t778,)-9/7t"-? M, (z, t)], 
and by Corollary (5.10) and Lemma (5.12) we have 
Azv(x,t) = (£720) 9)/7[t"-7A, M (z, t)] 
= (t710,)~-9/2[1"-29? M, (2, t) + (n — 1)t"-70, M,(z, t)] 
= (1718,)- 2" 1a, M, (2, t)) 
= a2(t719,)"-9)/2[t"-2.M, (2, t)] 
= d?v(z, t), 
so this term satifies the wave equation. Likewise, the function 


w(z,t) = (t71)~ 9/2 [4"- 2M, (2, t)] 


satisfies the wave equation, and hence so does 0;w(z,t), which is the first 
term on the right of (5.16). As for the initial conditions, by (5.14) we have 


u(a,t) = [eaty(e,) + ai t’OMy (2,1) + ot)| +1tM,(z,t) + O(t?) 


= My(2,t)+ _ sod tO, My (2, t) + tM,(a,t) + O(2?). 
0 
Hence 
u(x, 0) = M;(z,0) = f(z), 
and 3 
@ru(z,0) = AA) 9,144 (2,0) + My(2,0) = gle) 


because M;(z,t) is even in t and so its derivative vanishes at ¢ = 0. The 
proof is complete. | 
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The solution of the Cauchy problem (5.4) for even n is readily derived 
from the solution for odd n by the “method of descent.” This is just the 
trivial observation that if u is a solution of the wave equation in R"+! x R 
that does not depend on xn 44, then u satisfies the wave equation in R° xR. 
Thus to solve (5.4) in R” x R with n even, we think of f and g as functions 
on R"*! which are independent of 2,41, write down the solution (5.16) 
of the Cauchy problem, and check that it does not depend on tn41. The 
result is as follows. 


(5.17) Theorem. 

Suppose n is even. If f € C(r+4)/2(R") andg € Cn 42)/2(R"), the function 
(5.18) 

u(z,t)= aor | Hera? (er-} f(z + ty) ay) 


3++-(n—lwnga list 1 = [yl 


+ (t71a,)e-2/? (“ g(z + ty) ay) 
visa V1 — [yl? 


solves the Cauchy problem (5.4). 


Proof: Denote a point in R"+?! by (2, 2,41) where x = (24,...,2n). 
If we think of f and g as functions on R"+! that depend only on z, the 
solution to (5.4) in R"+! x IR is given by (5.16) with n replaced by n + 1, 
which involves the integral 


| ee ace 
vVirt¥ na =) 


and a similar integral with f replaced by g. We parametrize the upper and 
lower hemispheres of the sphere |y|? + y2,, = 1 by yn41 = +¢(y) where 
o(y) = V1—lyl?, ly| < 1. By a standard formula of advanced calculus, 
the element of surface area is given by 


dy 


V1 ~ ly?’ 


so we obtain (5.18). (The factor of 2 is there because we integrate over 
both the upper and lower hemispheres.) i] 


do(y, yngi) = V14+ (Vel? dy = 


The formulas (5.5), (5.16), and (5.18) give the solution to the Cauchy 
problem (5.4) in arbitrary dimensions. We note that these formulas agree 
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with the uniqueness theorem (5.3): for to > 0, the value of u at (x0, to) 
depends only on the values of f and g on the ball |x — xo| < tg cut out of 
the initial hyperplane by the backward light cone with vertex at (xo, to). 
Actually, when n is odd we can say more. If n = 1, u(tp,to} depends on 
the values of g on |x — xo] < to, but on the values of f only at ro = to. 
If n is odd and n > 3, u(zo, to) depends only on the values of f and g and 
their first few derivatives on the sphere |x — x9| = to — or, so to speak, 
on the values of f and g on an infinitesimal neighborhood of the sphere 
jz — to| = to. This fact is known as the Huygens principle, and we 
shall refer to the fact that it holds only in odd dimensions as the Huygens 
phenomenon. 

Physically, the Huygens phenomenon can be understood as follows. 
Suppose you are in a dark room at position 29, and someone at the origin 
sets off a flash bulb at time t = 0. At time to = |xo] you see a flash of light, 
then darkness again. In an even-dimensional world you would see a burst 
of light at time to, but instead of disappearing promptly it would gradually 
fade away. This effect can be observed in two-dimensional water waves by 
dropping a pebble into a pond and watching the ripples. 

We should say something about the role of the differentiability hy- 
potheses in Theorems (5.15) and (5.17). The conditions on f and g in 
these theorems are designed to ensure that u € C?, so that u is a classical 
solution of the wave equation. Additional smoothness of f and g clearly 
implies additional smoothness of u: to be precise, if f € CFt("-1)/2 and 
g € CE+("~3)/2 (n odd) or f € CHt/2) and g € Ch-!+("/2) (n even) 
then u € C*. On the other hand, if we assume weaker conditions on f 
and g and interpret the formulas (5.16) and (5.18) properly, we obtain 
distribution solutions of (5.4). 

In fact, for odd n > 3 let us observe that 


a Se +ty) daly) = f= ty) do(y) = f ¥Ei(2), 
Wn Siyj= lyj=a 


where ©, is the distribution 


(5.19) (Xt, %) = — p(ty) do(y). 


lyj=1 
(That is, Lz is surface measure on the sphere Sj,\(0), normalized so that 
the measure of Sj,)(0) is 1 and considered as a distribution on R".) The 
map t + LD; is a C® (in fact, analytic) function of t € R with values in the 
space of compactly supported distributions on R", and hence so is 
1 


(5.20) o, = TESTE. 


£70, \0 Di le Pay: 
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(@, is smooth even at ¢ = 0, by (5.14).) (5.16) then says that 
(5.21) u(.,t) = f * 0,4; + 9% D;. 


When rewritten this way, (5.16) makes sense when f and g are arbitrary 
distibutions on R” and defines u(-,¢) as a C™ function of ¢ with values in 
the space of distributions on R". (In particular, u may be regarded as a 
distribution on R" x R.) A standard approximation argument (consider 
f and g as limits of smooth functions) shows that u satisfies (5.4) in the 
appropriate sense. Similarly, when n is even, (5.21) holds with 


_ 1 =19 \(n—2)/2pyn—1 
O,= ieee” or) [t T,], 
(5.22) ; 


ty See oe 
ew) Wntl hues TF * 


In fact, the solution for n = 1 also has the form (5.21) where 
(5.23) (81,9) = if ¥(s) ds. 


Indeed, 3 fi W(s) ds = Y(t) + Y(-t), so 4:(s) = $[6(s — t) + 4(s + t)] 
when ©, is given by (5.23). 

The loss of continuous differentiability in passing from the Cauchy data 
to the solution in dimensions n > 2 is unavoidable. Intuitively, it happens 
because “weak” singularites in the initial data at different points will propa- 
gate along light rays and collide at later times, possibly creating “stronger” 
singularities. The remarkable thing, however, is that this can happen only 
to a limited extent. That is, although for any « > 0 there can be a loss of 
roughly in continuous derivatives in passing from u(z,0) to u(z,€), once 
these derivatives are lost there is no further loss in passing from u(z, €) to 
u(x,t) for t > e! Moreover, if one considers “LZ? derivatives” instead of 
“continuous derivatives,” there is no loss at all. We shall say more about 
this in §5D. 

Incidentally, one consequence of all this, which the reader may have 
noticed already, is that the wave operator is not hypoelliptic: Solutions of 
the homogeneous equation 0?u — Au = 0 can be arbitrarily rough. 


EXERCISES 


1. Show that if f and g are locally integrable functions on R, the function 
u(x,t) defined by (5.5) is a distribution solution of the one-dimensional 
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wave equation. In what sense do we have limi.ou(-,t) = f and 
limy—o O:u(-,f) = g? More generally, if f and g are distributions on 
IR, show how to interpret (5.5) as a smooth D’(IR)-valued function of t 
that satisfies (5.4) in an appropriate sense. 


2. The differential equation (5.11), i-e., (0? +(n—1)r7!6,]w = 6? wu, is the 
n-dimensional wave equation for radial functions (Proposition (2.2)). 
As explained in the text, when n is odd this equation can be reduced 
to the one-dimensional wave equation. In this problem we consider the 
case n = 3. 

a. Show that the general radial solution of the 3-dimensional wave 
equation is 


u(z,t)=r'[O(r+t)+o(r—t] (r=), 


where ¢ and 7 are functions on R. 
b. Solve the Cauchy problem with radial initial data (n = 3), 


O?u—Au=0, u(z,0) = f(\z|), Au(x,0) = g([z|), 


in a form similar to (5.5). (Hint: Extend f and g to be even func- 
tions on R.) 

c. Let u, f, g be as in part (b). Show that u(0,¢) = f(t)+tf' (t)+tg(t), 
so that u is generally no better than C) if f € C44 and g € CC). 


C. The Inhomogeneous Equation 


We now consider the Cauchy problem for the inhomogeneous wave equa- 
tion: 


Ou — Au= w(z,t), 


ia u(z,0)= f(z), du(z,0) = 92), 

which represents waves influenced by a driving force w(z,t). We know how 
to find a solution u; of this problem when w is replaced by 0. If we can 
also find a solution uz when f and g are replaced by 0, then u = uy + ug 
will be a solution of (5.24). But the latter problem is easily reduced to the 
former one by a version of the “variation of parameters” method known as 
Duhamel’s principle: 
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(5.25) Theorem. 
Suppose w € Cl"/?]+1(1R" x R). For each s € R let v(z,t;s) be the solution 
of 

jv — Av=0, v(x,0;s) = 0, 0,0(x, 0; s) = w(z, 8). 


Then u(z,t) = fo v(z, t — s; s) ds satisfies (5.24) with f =g = 0. 


Proof: Clearly u(z,0) = 0. Also, 
t t 
O,u(z,t) = v(z, 0; t) +f O,u(z,t— 5; s)ds= | Gjv(2, t — s; s) ds, 
ft) 0 
so 0,u(x,0) = 0. Differentiating once more in t, we see that 
t 
OF u(x,t) = v(x, 0;t) + [ O? u(x, t— s; s) ds 
0 
t 
= w(zx,t) +f Av(z, t — s; s)ds = w(z,t)+ Au(z,t), 
0 
so the proof is complete. ] 


The problem (5.24) is really of physical interest only for ¢ > 0. (If one 
considers t < 0, the Cauchy data f and g are “final conditions” rather than 
“initial conditions.”) If one wants to consider solutions of 0? — Au = w 
for arbitrary times, the following problem may be more natural. Suppose 
the system is completely at rest in the distant past, and then at some 
time to the driving force w(z,t) starts to operate. Thus, we assume that 
w(z,t) = 0 for t < to, and we wish to solve 0?u ~ Au = w subject to the 
condition that u(x,t) = 0 for t < to. This problem reduces immediately to 
Theorem (5.25) if we make the change of variable t + ¢ — to, but we can 
restate the solution in a way that does not mention the starting time to: 


(5.26) Theorem. 
Suppose w € Cl/21+1(1R" x R), and w(.,t) = 0 fort <0. Foreachs ER 
let u(x,t; 5) be the solution of 
av —Av=0, v(z,0;s) = 0, O,v(x,0; 8) = w(z, s). 
(Thus v(.,-;5) = 0 fors <0.) Then u(z,t) = fos v(z, t—s; s) ds satisfies 


O?u—Au=w, u(.,t)=0 fort <0. 
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Proof: Simply repeat the proof of Theorem (5.25). 1 


We can rewrite the solution u in Theorem (5.26) as follows. By the 
discussion at the end of §5B, the function v in Theorem (5.26) is given by 


u(-,t;s) = w(-,s) #2 Be, 


where ®, is given by (5.19) and (5.20), (5.22), or (5.23) depending on n, 


and +, denotes convolution with respect to the space variables. But then 


u(-,t) = ie w(-, 5) *, By_, ds, 


which is a convolution integral in ¢ if we replace ®, by 0 for s < 0. In other 
words, if we define the distibution 6, on R” x R by 


0 ift <0, 
that is, 
6.27) Gav = [eu vN)d (ECPM xR), 
we have 


u=we dy (convolution on R® x R). 


This holds in particular for any w € C?°(R” x R), which means that ®, is 
a fundamental solution for the wave operator. In short, we have: 


(5.28) Theorem. 

Define the distribution ®, on IR" by (5.19) and (5.20) ifn is odd and n > 3, 
by (5.22) ifn is even, and by (5.23) ifn = 1, and define the distribution 
@, on R" x R by (5.27). Then 4 is a fundamental solution for the wave 
operator. 


The solution of the inhomogeneous equation in Theorem (5.25) is not 
given by a convolution integral, since the imposition of the conditions at 
t = 0 breaks the translation invariance in ¢. We shall investigate 6, further 
in the next section. 
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EXERCISES 


1. Let u be the solution of 0?u— Au = w given by Theorem (5.25). For 
each (xo,t9), determine the region of R® x R in which the values of w 


influence the value of u at (zo,¢o). Do the same for the solution u in 
Theorem (5.26). 


2. Let LD be a constant-coefficient differential operator on R”®. Suppose 
{Fi}i>0 is a family of distributions depending smoothly on ¢ such that 


where 6 denotes the point mass at the origin. 
a. Show that if g € Co°(IR"), u(z,t) = g * Fi(z) solves the Cauchy 
problem 


0 (i<m-1), 


dPus=Lu (t>0), aiu(e,0)= {0 (j =m-—1). 


b. Define the distribution Fy on R” x R by 


(Fed) = f° (Fe, 0.0) dt 


Show that Fy, is a fundamental solution for 07 ~— L. (Note that 
the fundamental solutions of the heat and wave equations given by 
(4.5) and (5.27) are of this form.) 


D. Fourier Analysis of the Wave Operator 


We now re-examine the problems solved in the previous two sections by 
using the Fourier transform. To begin with, we consider the Cauchy prob- 
lem (5.4) for the homogeneous wave equation. Application of the Fourier 
transform in the space variables converts this into 


PG+4n7EPG=0, A(E,0)=F(E), — HUE, t) = GLE), 


where t(-,t) denotes is the Fourier transform of u(-,t) for each t. The 
general solution of this ordinary differential equation is a linear combination 
of cos 2m|é|t and sin 2z|é|t; taking the initial conditions into account, we 
see that 


sin 27|é|t 


(5.29) G(E,t) = f(€) cos 2mle|t + ()— Talk| 
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Therefore, 
u(.,t)=f*eWitg +h, 


where W and @ are the inverse Fourier transforms of the functions 
VE) = cos2rlé|t = 81 (€). 


Direct evaluation of these inverse Fourier transforms is not easy, because 
although , and 4, are perfectly nice tempered distributions, they are not 
L} functions. Of course, we already know the answer: if we compare (5.29) 
with (5.21), we see that ; is the distribution defined by (5.19) and (5.20) 
when n is odd and > 3, by (5.22) when n is even, and by (5.23) when 
n= 1; and W, = 0:4,. When n = 1, the direct calculation of 5, from ®; is 
a triviality: 


en Brit ay elit 


t 
&.(¢) = 1 -2ni€z gy — 
BAe) = 5 is fs = —4Ani€ 
_ sin2n€t — sin 2n|é|t 


Qné Amiel 


The corresponding calculation for n = 3, where &; is a scalar multiple of 
surface measure on Sj,)(0), is also quite easy; see Exercise 1. For n # 1,3, 
however, going from ®; to &, is rather arduous. 

It is, however, possible to apply the Fourier inversion formula to &, ina 
way that readily displays some of the most important qualitative features of 
$,, although not the exact nature of its singularities on the sphere 5j,)(0). 
Namely, we consider 


Berg) — 28S _ one Sin 27|é|t 
BE) = ew IGG) = erPRUITETEE (> 0). 


Clearly 34 _ o, uniformly as ¢€ — 0, so that ©; will be the limit in the 
topology of tempered distributions of ®f, the inverse Fourier transform 
of &. Moreover, os € L}(IR"), so we can calculate its inverse Fourier 
transform as an ordinary integral. In fact, we have 


en 2sl€l ds, 


a e-2r(e-itl€] _ p-2e(esitlel petit 
&(€) = ——__T, | 
€ 


anilé| ee re par 
so 


€ 1 a 2nté-c—2wsl€l 
Oi(x) = ay e e dé ds. 


The Wave Operator 179 


Now, the inner integral is the inverse Fourier transform of e~?**!|, which 
we calculated in Theorem (4.15): 


Bes T'((n + 1)/2) s 
Qrit-xc —2wrslél — 
a £ € dg = enti? (524 [2jern/e 


Actually, Theorem (4.15) pertains to the case s > 0, but this formula 
remains valid for all complex s with Res > 0 provided that (s? + |x|?)!/? 
is taken to be the square root of s? +|z{? with positive real part. (One can 
either repeat the proof of Theorem (4.15) with s complex, or argue that 
both sides of the formula are analytic functions in the half-plane Res > 0 
that agree for s > 0 and hence everywhere.) Therefore, if n > 1, 
(5.30) 
pe ODA a 

t Dia (n+1)/2 parr (s2 + |x|2)@+0/2 

T'((n +1)/2) 1 1 

~ 21 — n)x(+)/2 Fern: + [x[2]@-D/2 © [(e— it)? + aE ; 


ds 


(The corresponding calculation for n = 1 is left to the reader; see Exercise 
2.) 

Now we let ¢€ — 0. Clearly ©; — 0 uniformly on compact subsets of 
{z : |z| > |t|}, so we recover the fact that supp @ C {x : |z| < |t|}. This in 
turn is equivalent to the facts about the localized dependence of the solution 
of the Cauchy problem on the initial data that we originally derived from 
Theorem (5.3). Moreover, if n is odd (and > 1), so that (n — 1)/2 is an 
integer, bf — 0 uniformly on compact subsets of {zx : |x] < |t|} also. Hence 
supp ® C {z: [z| = |t]}, which is a restatement of the Huygens principle. 
However, for |x| < |t| the quantities (e + it)? + |z|? approach the negative 
real axis (the branch cut for the square root) from opposite sides, so if n 
is even the two terms on the right of (5.30) do not cancel out but add up, 
with the result. that ®,; agrees on the region {z : |x| < |t|} with the function 


(—1)*/2)- 1 T((n + 1)/2) sgnt 1 
(n— Ir4)/2 (t? — |x |2)(--1)/2° 


On the other hand, (5.22) implies that ®, agrees on this region with the 
function 


2sgnt 1g \(n/2)-1/42 2-1/2 
— (8 t x 
Lae tea ‘) ( | ") ’ 


and it is an elementary exercise to see that these two functions are the 
same. 


180 Chapter 5 


One of the main advantages of the Fourier representation (5.29) is that 
it yields easy answers to questions about L? norms. For example, it is 
obvious from (5.29) and the Plancherel theorem that if f,g € L?(IR") then 
u(-,t) € L?(IR") and ||u(-,¢)}|2 is bounded independently of t, so that u € 
L?(R® x [to, t:]) whenever —oo < tg < t1 < 00. (See also Exercise 3.) This 
result can be refined to take account of smoothness conditions. To do so, 
we shall have to get ahead of our story a little and introduce an idea that 
will be explained more fully in Chapter 6. 

If k is a positive integer and Q is an open set in R” (or R" xR), we denote 
by H;(9) the space of all f € L?(Q) whose distribution derivatives 0% f 
also belong to L?(Q) for ja] < k. If 2 = R", by the formula (0° fy(é) = 
(2mig)? F(E) and the Plancherel theorem we see that f € H,(IR") if and 
only if €°f € L? for |a| < k, or equivalently, (1 + |é|)*f € L?. 

Now, from (5.29) we have 


(83 8} uy (E, t) = (2mi€)% (ele l)? FE) trig 2x|élt 

+ (2mi€)* (Qa |E |) —*G(E) trig Qe lt, 
where trig denotes one of the functions + cos or tsin. From this it is clear 
that if f € H,(IR") and g € Hy_1(IR") then 0207 u(-,t) belongs to L?(R”) 
with L? norm bounded independent of ¢ for ja|+j < k. An integration 
over a finite t-interval then yields the following result, which shows that, 
unlike the situation with continuous derivatives, solving the wave equation 
preserves L? derivatives. 


(5.31) Theorem. 
Let u be the solution of the Cauchy problem 


aPu — Au = 0, u(x, 0) = f(z), O,u(x,0) = g(z). 


If f € A,(R") and g € Hy-1(IR") then u € H,(IR® x [to, t1)) whenever 
—o0o < ty < ty < 0. 


There are also various boundedness theorems for solutions of the wave 
equation in terms of Z? norms with p # 2, but these are mostly quite 
recent and depend on some deep results of Fourier analysis. In particular, 
Peral [39] has proved an L? analogue of Theorem (b, me (but with a small 
loss of smoothness, depending on p) for \5 - i/< < s27- See also Stein [46, 
§VIII.5], and the references given there. 

We now turn to the question of finding a fundamental solution for the 
wave equation by Fourier analysis. Here we wish to solve the equation 


OP u(x,t) — Au(z,t) = 6(z,t) = 5(2)6(t). 
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Application of the Fourier transform in z turns this into 
dPU(E, t) + 4m? |€|7a(E, t) = 6(2). 


The recipe for solving this ordinary differential equation is as follows. a 
must be a solution of the homogeneous equation 07% + 417 |€|?a@ = 0 for 
t <0 and for ¢t > 0, so 


eo __ f a(€) cos 2n|élé + O(€) sin 27|élé (t < 0), 
ue 0 { c(€) cos 2m|élé + d(é) sin 2x|E]é (t > 0). 


To obtain the delta function at ¢ = 0 on applying 0? to @, we require that 
u should be continuous at ¢ = 0 but that 0,u should have a jump of size 1 
att=0: 


a(€)=c(é), (2m |)[4(€) — 6(€)] = 1. 


This gives two equations for the four unknowns a,b,c,d; the remaining 
degrees of freedom can be used to satisfy side conditions. If we think if 
u(x,t) as the response of a system at rest to a sudden jolt at r=0,¢ =0, 
it is reasonable to require u(z,t) = 0 for t < 0. Hence, we take a = b = 0, 
which yields c = 0 and d = (2z|€|)~!. In short, 


(5.32) a(é,t) = 4 rk sin 2n|é|t = ,(€) Hany 


so that u is the fundamental solution ®, given by Theorem (5.28). 

It is of interest to compute the full Fourier transform of @, in both z 
and t. We expect to get something like [4?(|€|? — r?)]~}, but the latter 
function is not locally integrable near points of the light cone and so will 
need to be interpreted suitably as a temepered distribution. In fact, we can 
compute the Fourier transform in ¢ of (5.32) by the same device that we 


used to compute the inverse Fourier transform in z of ®;. Namely, consider 

s eae e2rit(leltic) _ -2eit(—leltie) 

6 (€,t) =e7 @,(€,t) = t - 
+é ) e +E ) X{0,00)( ) 4nile| 


The t-Fourier transform of this is 


ie oo _2mit(lé|—r+ie) _ e2nrit(—lél—-r+#e) 
en 28TH BE (Et a= | lt 
/ HG edt= f ariel 


_- i lan Gn Ee 

~ gre] [lel—r+ie  lel-4+r—te 
1 

~ allel? — (7 — te?) 
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Clearly od — &, uniformly as € > 0, so the full Fourier transform of 
@, is the limit in the topology of tempered distributions of the functions 
((4n?)(\el? — (7 = e)2))2. 

The fundamental solution ®; has the property that the solution of u = 
ws, of the inhomogeneous equation 0?u—Au = w at timet = tp depends 
only on the values of w at times t < to, and it is uniquely determined by 
this property. It is therefore the natural fundamental solution to use when 
u is considered as the response of a system to an driving force w. However, 
there are other fundamental solutions for the wave operator that are of 
significance in physics. One is the time reversal of ®,, 


_(z,t) = 4 (z, —t), 


whose full Fourier transform is the distribution hmit of the functions 


1 


OG FPS FIT 


(4 and ®_ are often called the retarded and advanced Green’s func- 
tions in the physics literature.) Another is the causal Green’s function 
or Feynman propagator ®,, the distribution whose full Fourier trans- 
form is the distribution limit of the functions 


1 


FREE, T) = ane? — 7? — ie) 


as ¢ — 0+. The reader may find an explanation of the role of these Green’s 
functions in quantum field theory in Bogoliubov and Shirkov [8]; here we 
merely wish to point out that there are a number of natural but quite 
different fundamental solutions of the wave equation. (In other words, 
there are a number of natural but quite different distributions that agree 
with the function [47?(|€|? — r?)]~! away from the light cone!) 

We refer the reader to Treves [52] for further discussion of the properties 
of b,. 


EXERCISES 


1. When n = 3, the distribution ®, defined by (4.19) and (4.20) is 
(4n|t|)~1o,4, where o, is surface measure on S,(0). Show by a di- 
rect. calculation that ,(€) = (2m|é|)! sin 2alé|t. (Hint: Since &, is 
radial, so is ,; hence it suffices to consider € = (0,0, p) with p > 0. 
Integrate in spherical coordinates.) 
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2. Perform the calculation (5.30) when n = 1 to obtain the formula (5.22) 
for @, in the one-dimensional case. 


3. Use the Fourier representation (5.29) to rederive the result of Exercise 
2, §5A. That is, suppose u, f, and g are related by (5.29) and that 
f € H,(R") and g € L7(R"). Show that for allt € R, 


[venue nPae =f \Veru(2,0)Paz = [VSP + lale)P) de. 


E. The Wave Equation in Bounded Domains 


When solving the wave equation in the region 2 x (0,00), where Q is a 
bounded domain in R", it is appropriate to specify not only the Cauchy 
data on 2 x {0} but also some condition on 82 x (0,00) to tell the wave 
what to do when it hits the boundary. The most commonly used conditions 
are those of Dirichlet and Neumann, ie., u = 0 or O,u = 0. For either of 
these the solution is unique, as may be proved by the same method as 
Theorem (5.3); see Exercise 1. 

If the boundary condition on 8Qx(0, oo) is independent of ¢, the method 
of separation of variables can be used, just as for the heat equation (§4C). 
That is, we can look for solutions u in the form u(z,t) = F(z)G(t) where f 
satisfies the boundary condition on OQ. The wave equation 0?u — Au = 0 
is equivalent to the equation (AF)/F = G"/G, and since (AF )/F depends 
only on x and G’’/G depends only on t, these quantities must be equal to 
some constant which we write as —\? in the hope that it will be real and 
negative. If we can solve the equation AF + \?F = 0 on 2 for some A > 0 
subject. to the given boundary condition on 0Q, we obtain solutions of the 
wave equation 

u(z,t) = F(z)(acos At + bsin At) 


where a and 6b are constants that can be adjusted to satisfy initial condi- 
tions. This represents a normal mode of vibration with frequency 4, and 
with luck we will be able to obtain the general solution as a superposition 
of normal modes. 

In particular, consider the Dirichlet boundary condition u(z,t) = 0 for 
z € 8Q, and assume that OQ is smooth. We shall see in §7G that L?(Q) 
admits an orthonormal basis {F;} consisting of eigenfunctions for A on Q 
with negative eigenvalues {—\j} and satisfying F; = 0 on 8Q. Hence we 


184 Chapter 5 


can solve the problem 
O?7u — Au = 0 on 2 x (0,00), 
u(z,0)= f(z),  Au(z,0)=9(z), uz, t) = 0 for z € ON, 


by taking 
u(z,t) = ys F;(x)(a; cos A;t + b; sin A;t) 


Sah=f DAR =o, 


a; = (FI Fj), bs = AF*(g LF). 
If Q is a ball, we have already seen how to do this in §2H. 
For n = 1 and 2 = (0,]), the problem is 
Gu — Au = 0 on (0,1) x (0, 00), 
u(z,0)= f(z), Au(z,0)= g(x), u(0,t) = ufl,t) = 0. 
The normalized eigenfunctions are Fj(z) = \/2/I sin(jaz/l) and the asso- 
ciated frequencies \; are the integer multiples of the fundamental frequency 


m/l, and we obtain the familiar expansion in “harmonics” of the vibration 
of a string fixed at both ends: 


where 


that is, 


(5.33) 


40 grt juz 
u(z,t)= Des [; cos? cos — ; ¢ b; sin *] sin —— T 
(5.34) 


t . i . 
aj = if f(z)sin 2 dz, bj = = | g(z) sin 2 de. 


For n = 2 and Q = Br(0), corresponding to a vibrating circular mem- 
brane fixed at the boundary, the numbers Aj; are zeros of Bessel functions. 
These numbers are not rational multiples of each other and so do not “har- 
monize” particularly well. For this reason a violin has better tone quality 
than a drum. 

For more about vibrations and eigenvalue problems, see, e.g., Courant 
and Hilbert [10, vol. I] or Folland [17]. 


EXERCISES 


1. Suppose 2 is a smoothly bounded domain in R", and suppose uw is a 
C? function satisfying 0?u — Au = 0 in Q x (0,00) and either u = 0 
or Oyu = 0 on A x (0,00). Adapt the proof of Theorem (5.3) to 
show that E(t) = f, |Vz,ul? dz is independent of t. Conclude that if 
u(z,0) = Gu(z,0) = 0 then u=0. 
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2. Suppose f and g are continuous functions on (0, /] that vanish at both 
endpoints. Let f and g be the functions on R obtained by extending 
f and g to be odd functions on [—I,/] and then extending them to be 
periodic functions on R with period 2I, and let u be the solution of 


a7u—Au=0, u(z,0)= F(z), Gu(z, 0) = 9(z) 


on R x R given by (5.5). Show that the restriction of u to (0, /] x [0, 00) 
is the solution of (5.33). (This form of the solution tells you what you 
see when you look at a vibrating string, but (5.34) tells you what you 
hear when you listen to it.) 

3. Solve the wave equation 02u — Au = 0 on (0,1) x (0,00) with initial 
conditions u(z,0) = f(r), d:u(x,0) = g(r), subject to the boundary 
conditions u(0,t) = O,u(l,t) = 0. In particular, show that the fre- 
quencies are the odd integer multiples of the fundamental frequency 
n/2l. (This problem models vibrations of air in a cylindrical pipe that 
is open at one end and closed at the other, like a clarinet; u represents 
the change in air pressure relative to the ambient pressure. On the 
other hand, (5.33) models vibrations of air in a cylindrical pipe that is 
open at both ends, like a flute.) 


4. Let Q = {x € R®: |x| < I}. Use separation of variables to solve the 
wave equation for radial functions in 2 x (0,00) with Dirichlet boundary 
conditions. That is, by Proposition (2.2), the problem is to solve 


O? u(r, t) — O2 u(r, t) — 2r7'0, u(r, t) = 0 forO< r<l, t>0, 
u(r,0)= f(r), Qu(r,0)= g(r), u(l,t)=0. 


(Hint: The eigenvalue problem to be solved is F"(r) + 2r71F'(r) + 
?F(r) = 0, with F(l) = 0 and F(0) finite. Reduce this to a more 
familiar problem by setting F(r) = r~!G(r).) In particular, show that 
the frequencies are the integer multiples of the fundamental frequency 
m/l. (The restriction of u to a narrow conical region, say {z : ¢(z) < 
€} where ¢(x) is the angle from z to the north pole, is a model for 
vibrations of air in a conical pipe like an oboe, bassoon, or saxophone; 
cf. Exercise 3.) 


F. The Radon Transform 


To conclude this chapter we present an elegant method for solving the 
Cauchy problem (5.4) based on the “Radon transform.” We begin by 
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observing that if the Cauchy data depend only on z; and not on z2,...,2n, 
then the solution u will also be independent of z2,...,2,, and will be 
expressed by the one-dimensional formula (5.5): 


v1 


+t 
u(z,t) = $[f(2i +t) + f(zi -t] +4 g(s) ds. 


xy -t 


(This is clearly a solution; by uniqueness, it is the solution.) By performing 
a rotation in R", we see more generally that if f and g are constant on 
all hyperplanes with a fixed normal vector w, say f(x) = F(c-w) and 
g(x) = G(x -w), then u will have the same property and will be given by 


few+t 


u(z,t) = HF(e-w+t)+ Flew -H)+$ [ G(s) ds. 


Lett 


In this case u is called a plane wave in the direction w. 

The idea is to decompose “arbitrary” functions f and g into sums (or 
integrals) of functions depending only on z-w as w varies over the unit 
sphere, and then to express u as the corresponding sum of plane waves. In 
order for the integrals that arise to be classically convergent, it is necessary 
to assume that f and g have some degree of smoothness and that they 
decay reasonably fast at infinity. We shall avoid all technical complications 
by assuming that f and g are in the Schwartz class 8. 

Let us denote by S,, the unit sphere in R*: 


S, = {2 € R*: |z| = 1}. 


If f € S(R*), its Radon transform Ff is the function on R x S, defined 
by 
Re(su)= f _ s2)de, 
rw=os 

where dz denotes (n — 1)-dimensional Lebesgue measure on the hyperplane 
x-w=s. Clearly Rf € C™(R x S,) and Rf(-,w) € S(R) for each w € Sy. 
Moreover, Rf(s,w) = Rf(—s, —w). 

Ris closely related to the Fourier transform. In what follows, we shall 
denote by Rf the Fourier transform of Rf with respect to its first variable. 


(5.35) Proposition. a, = 
If f €8, p€ R,andw € Sy, then Rf(p,w) = f(pw). 
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Proof: We have 


F (pw) wot ere * f(z) dvix ay Bas e7 2405 F(z) dzds 


= i e728 RF(s,w) ds = Rf(p,w). i] 
R 
(5.36) Corollary. 
Iffes, 
Rf(s,w) = [etF ow) dp. 
Proof: Apply the Fourier inversion theorem to Rf : ] 


Proposition (5.35), together with the Fourier inversion theorem and the 
formula for integration in polar coordinates, yields an inversion formula for 
the Radon transform. Indeed, 


g(ay= fr Fleyae = ff” ctr Fp)p"™ dpda(w) 
=f [0 are Rieu" dpdotw) 
nO 


= A(x -w, w)do(w), 
Sn 
where 
oo . — 
h(s,w) =/ e?™* RF (p,w)p"—* dp. 
Ct) 

But since A(x -w, w) is integrated over the entire unit sphere, the result is 
the same if we take only its even part in w: 


(5.37) f(z) = if [A(z -w, w) + h(—z-w, —w)] do(w). 


Now clearly R}(p,w) = RF(-p, —w) by Proposition (5.35), so 
3[h(s,w) + h(—s, -w)] 
oo co 
= +/ cP? Rf(p,w)p"—" dp + if eW?t* RF(p, wp") dp 


00 0 
= +f One (a, we" dp § [ ec?" Rf(p,w)(—p)"—* dp 
0 


—0oO 


oo 
= [7 emo RF( 6,0) dp 
-00 
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We call this function the modified Radon transform of f and denote it 
by Rf(s,w). 7 
We can express Rf more directly in terms of Rf. If n is odd, then 
\p|"—1 = p"—1, and we have 
R (-1)@e—D? is 2nxips Qni n-1 py 
f(s,w) = Soper fs (2zip) F(p,w) dp 
= (-1)ir7D/2 n—1 
= Tampa OP-* Rf(s,w). 
On the other hand, if n is even, 
(-1)-2)/? i 2xips : - n=l pF 
Mampat Jo” (—isgn p)(2mip)”~* Rf(p,w) dp 
(—1)(0-?)/2 
~  2(2r)n-1 
where H is the Hilbert transform, defined for ¢ € S(IR) by 


(HY (p) = (—isgn p)4(p), 


Rf(s,w) = 


HOP Rf (sw), 


or equivalently by 


1 ¢(s —t) 
H = lim — —~—_— dt. 
4(s) eno T |t|>« t 


(For the equivalence of these two formulas, see Exercise 1.) In any event, 
(5.37) becomes 


He) = f Rf(a2-w, w) do(w), 


which is the inversion formula for the Radon transform. 
Now we can solve the Cauchy problem (5.4) for f,g € 5. Namely, we 
write 


(2) = [ Ryle-w,w)do(w), (2) = ff Role -w, w) dow), 


expressing f and g as superpositions of functions that depend only on 
z-w for various values of w. Then the solution u is the corresponding 
superposition of plane waves: 


u(x,t) = if [Re +t,w)+Rf(2-w —t, w) 
(5.38) ie 


“6 [ ; < Fig(s,w) as do(w). 


‘we 
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(See also Exercise 2.) 

In this setup, the Huygens phenomenon arises from the difference in 
the formulas for R when n is even or odd. It is related to the fact that Os 
is a local operator (that is, 8,¢(s0) depends only on the values of ¢ near 
So), whereas the Hilbert transform H is not. 

For more about the Radon transform and its applications to differential 
equations, see John [29] and Ludwig [35]. See also Deans [11] for a discus- 
sion of some of the many other applications of the Radon transform in 
astronomy, medicine, and other areas, and Helgason [24] for a development 
of the Radon transform in a more general geometric setting. 


EXERCISES 


1. Show that the inverse Fourier transform of the function h(p) = —isgnp 
on R is the distribution hY defined by 


(hY, pb) = lim ae VCS) as. 


€0 1 & 


Hint: First show that the inverse Fourier transform of e~?*¢l/lh(p) is 
s/1(s? + €?), then apply Theorem (0.13) with 


8 ___X(~00,—1)(5) + X(1,00)(5) 


ols) = a+) 7 


2. Show that R(0;f)(s,w) = wj0,Rf(s,w), and hence that R(Af) = 
O?Rf. Application of the Radon transform therefore reduces the n- 
dimensional wave equation to the one-dimensional wave equation; use 
this fact to derive (5.38). 


3. Show that R(f *g) = Rf +, Rg, where *, denotes the convolution of 
two functions of (s,w) with respect to s. 


4. Compute Rf where f(z) = ew 7”, 


Chapter 6 
THE L? THEORY OF DERIVATIVES 


One of the most elegant and useful ways of measuring differentiability prop- 
erties of functions on R® is in terms of L? norms. The reason for this is 
twofold: first, L? is a Hilbert space; second, the Fourier transform, which 
converts differentiation into multiplication by polynomials, is a unitary iso- 
morphism on L?. The resulting function spaces are known as (L?) Sobolev 
spaces. 

In the first two sections of this chapter we develop the basic properties 
of Sobolev spaces on R”. In the next two sections we use them to prove 
the local regularity theorem for elliptic operators (a result which we shall 
re-derive, in a refined form and with more sophisticated methods, in Chap- 
ter 8) and Hormander’s characterization of constant-coefficient hypoelliptic 
operators. In the final section we study Sobolev spaces on bounded do- 
mains, where the Fourier transform is not directly available but to which 
the results on R® can be applied. 


A. Sobolev Spaces on R” 
To begin with, if k is a nonnegative integer, we define the Sobolev space 
Hy, = H;,(R") of order k to be the set of all f € L?(IR") whose (distribu- 
tion) derivatives 0% f belong to L?(IR”) for |a| < k: 

Hy={f €L?:8°f € L? for lal < k}. 
This definition of H, makes its meaning clear, but there is an equivalent 


characterization of H; in terms of the Fourier transform that is easier to 
work with: 
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(6.1) Theorem. 7 
f © Hy if and only if (1 + |€|?)*/?f € L?, and the norms 


6-[¥ No"sB>| ery [ / Ferra + tery" ae] . 


lal<e 


are equivalent. 


Proof: Since (O° fJ() = (2mi¢)* f(€), the Plancherel theorem im- 


plies that 

D laevis = Yo /FerPieneye ae, 

lalsk lal<k 
so the theorem amounts to proving that the quantities Dial<k \€7|? and 
(1-+|€|?)* are comparable, i.e., that each is bounded by a constant multiple 
of the other. But clearly |€¢| < 1 for |é| < 1, and |€%| < |é|lel < |é|* for 
lel > 1 and [al < k, 80 


SS le*l? S$ Ci max(1, le)*) < (1+ [€)?)*. 

Jaise 
On the other hand, since |€|?* and $7} |€}|? are both homogeneous of degree 
k, we have |€|?* < Cz 57} |é#|? where C2 is the reciprocal of the minimum 
value of S77 |€}|? on the unit sphere |€| = 1. It follows that 


(1+ [él?)* < 2* max(1, |é1*) < 2*(1 + lé[?*) 


<20a]1+ IGF] <tc DD ler, 
1 


lol<k 


which completes the proof. | 


This suggests a generalization of H, in which k is replaced by an ar- 
bitrary real number s. Namely, if u(é) is a function on R” such that 
(1 + |é|?)*/2u(é) € L?, then ud € L! for any ¢ € §, so u is a tempered 
distribution (whose action on ¢ € 8 is f u@). Since the Fourier transform 
maps tempered distributions into tempered distributions, we can define the 
Sobolev space of order s: 


H, = H,(R")= {! € 8'(IR") : f is a function and 
(6.2) - 
Wie = f Fea + lePy' ae < oo} 
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The norm |]: {], on H,; thus defined is called the Sobolev norm of order 
s. Theorem (6.1) shows that this definition agrees with the previous one 
when s is a nonnegative integer. In particular, Ho = L?, and the norm 
on L? will henceforth be denoted by || - |lo. (There will be no notational 
confusion between the Sobolev norms ||- ||, and the L? norms, since we 
shall rarely use the latter for p # 2, and when we do, we shall henceforth 
denote them by || - [|z-) 

The argument that proves Theorem (6.1) also proves the following gen- 
eralization if it; one has merely to introduce factors of (1+ |€|?)* in appro- 
priate places: 


(6.3) Theorem. 

Suppose k is a positive integer, s is a real number, and f is a tempered 
distribution. Then f € H, if and only if O° f € H,_,% for |a| < k, and the 
norms 


1/2 
Iifile and iS lors] 


lal<k 


are equivalent. In particular, 8° is a bounded operator from H, to Hs_% 
for all s when |a| < k. 


H, isa Hilbert space with inner product 


(Fla). = / FOTO + le?" ae, 


and the Fourier transform is a unitary isomorphism from H, to L?(IR®, 1) 
where dyu(€) = (1+ |€|?)* d€. Standard approximation arguments show that 
S (or even C2°) is dense in the latter space, and since the Fourier transform 
maps § onto itself, 5 is dense in H,. 

If s <t, it is clear that H; C H, and that ||-||s < ||- lle, so the inclusion 
map from H;, to H, is continuous. In particular, H, C Ho = L? for s > 0, 
so the elements of H; for s > 0 are functions. For s < 0 they are, in 
general, more singular objects. 


Example 1. 

With n = 1, let f(z) = (wz)~!sin2az. An elementary calculation shows 
that f is the inverse Fourier transform of x{~1,1], 80 f € Hs for all s. It 
is also easy to verify directly, by induction, that f() decays like x7! at as 
az — 00 and hence that f(*) € L? for all k. (This example shows, however, 
that elements of Sobolev spaces need not decay very rapidly at infinity.) 
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Example 2. 
Let 6 be the point mass at the origin. Since 6 = 1 we have 


oO 
Note = f+ le) dean [re ar, 
fy 
which is finite if and only if s < —4n since the integrand is roughly r7*+"~1 
for large r. Hence 6 € H, if and only if s < —4n. 


Since H, is a Hilbert space, it is naturally isomorphic to its own dual 
space — or rather anti-isomorphic, since the identification is conjugate- 
linear. However, there is an equally natural anti-isomorphism between 
(H,)* and H_,, given not by the inner product (-|-), but by the ordinary 
L? inner product on the Fourier transform side. Indeed, if f € H, and 
g € H_,, then 


FF=(A4NEPYPAIA + EP) EL? - cL, 
so (F | 9) = SFT is absolutely convergent, and f > (FI9) defines a bounded 


linear functional on H,. Indeed, 


2 


. - 1/2 / 
Wlals [fier tery ae] | fera +tery* ae| 
= lillal-s 


with equality if Fe) = (1+ |€|?)~*g9(€), and the self-duality of Z? implies 
that every bounded linear functional on H, is of this form. In short, the 
map taking g € H_, to the functional f - (Fl g) defines an isometric anti- 
isomorphism between H_, and (H;)*. Moreover, if we restrict the latter 
functional to $, we obtain a tempered distribution which is nothing but 9: 


(615) = (6,3) = (9,9) = (4,9). 


Thus, the duality between H, and H_, is, except for the necessary intro- 
duction of a complex conjugate, compatible with the duality between § and 
S!. 

We can express the norm on H, neatly in terms of the operator 


3/2 


At = [I -(2x)774] (s ER). 


Here we are using the functional calculus for the Laplacian discussed in 


§4B, that is, 
(6.4) (At Fy (€) = (1+ le?) F(E). 


194 Chapter 6 
Clearly, by the Plancherel theorem, 
Flls = (JA? fllo = A’ File2, = (Fig). = (APF A%Q), 
and A‘ is a unitary isomorphism from H; to H:-; for allt ER. 
Although Sobolev spaces make the manipulation of distribution deriva- 
tives very easy, they would be of limited usefulness if we could not relate L? 


derivatives to classical pointwise derivatives. Fortunately, there is a simple 
and beautiful connection between the two. 


(6.5) The Sobolev Lemma. 
Ifs>k+ in, then H, C C* and there is a constant C = Cy,z such that 


(6.6) sup sup |d*f(z)| < Cllfll,. 


Jal<k 2eR™ 
Proof: By the Fourier inversion theorem, if (0° f° € L} then 0° f is 
continuous and sup, |8% f(x)| < ||(O% fY|]z1. Hence, to prove the theorem it 


is enough to prove that ||(@°f)||z: < |Ifll, when |a| < k. But (8° fY(€) = 
(Qwié)* f(€), and by the Schwarz inequality, for |a| < k we have 


[iene Fela <anyt [0+ IFO 
= (2myt f+ ley 1F@)IG + EPO“! ag 
~ 1/2 
< (ont | fa sirrimeras fa+irytacl 


The first integral on the right is ||f{|?, so the theorem boils down to the 
fact that 


eo 
fo + ié?)*-* dé = vn f (+ p?yk-s_n—l dr < 00 
) 
precisely when s > k + gn. (Cf. Example 2 above.) i] 


(6.7) Corollary. 
If f € H, for alls ER, then f EC. 


(6.8) Corollary. 
Every distribution with compact support belongs to H, for some s E R. 
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Proof: If f is a distribution with compact support, it follows from 
(0.32) and the Sobolev lemma that for some positive integer NV, 


KF, d)1<C D> sup ld%d(z) <C'lidll, (6 € 8), 


lalgn * 


where s = N+ in+ 1. But this means that f extends to a bounded linear 
functional on H, and hence belongs to H_,. I 


Remark 1: The proof of the Sobolev lemma shows that H, C BC* 
fors > k+ in, where BC* is the space of C* functions f such that 
O° f is bounded for |a] < k. Conversely, if H, C BC*, the closed graph 
theorem easily implies that the estimate (6.6) holds. But this means that 
the functionals f — O° f(x) (la] < k, c € R") are a bounded set of bounded 
linear functionals on H,, or in other words, that the distributions 0°5(-~ 2) 
are a bounded subset of H_,. A calculation like the one in Example 1 above 
shows that this happens precisely when s > k + in. (See Exercise 1.) 


Remark 2: The Sobolev lemma can be sharpened a bit: if s = k + 
a+ 4n where 0 <a <1, then H, C C*t*. See Exercise 2. 


A few examples may help to elucidate the meaning of the Sobolev 
lemma. 


Example 3. 

Pick ¢ € C& with ¢ = 1 near the origin, and let f,(z) = |z|*¢(x) where A 
is positive and not an even integer. Since 0% f, is homogeneous of degree 
A — ja] near the origin, an integration in polar coordinates shows that 
O° f € L? for |a| < A+ fn, but O°f is continuous only for ja] < ». (See 
Exercise 3.) 


Example 4. 

Let f(x) = e~?"!*l, By Theorem (4.15) and the Fourier inversion theorem, 
F(€) is a constant times (1 + [€|?)"@+/2, It follows easily that f € H, 
for s < in +1 but f ¢ H(nj2)41- The Sobolev lemma says that f should 
be continuous (which it is), but just barely fails to guarantee that f € C! 
(which it is not). 


Example 5. 
Suppose f € C* has compact support (so f € Hy also), and let u be the 
solution of the Cauchy problem 0?u—Au = 0, u(x,0) = f(x), u(r, 0) = 0. 
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The argument leading to Theorem (5.31) shows that u(-,¢) € Hx for all t, 
but (5.16) and (5.18) show that u(.,t) may be no better than C#-[r/?I, 
where [n/2] is the greatest integer in n/2. This is essentially the maximum 
discrepancy between L? derivatives and continuous derivatives allowed by 
the Sobolev lemma. 


Elements of L? are only defined almost everywhere, and in general it 
does not make sense to evaluate them at a single point. The Sobolev lemma, 
however, shows that if s > $n, the evaluation map f — f(z) (z € R") has 
a natural meaning for f € H,. More generally, it turns out that ifk <n 
and s > $k, it makes sense to restrict functions in H, to submanifolds 
of codimension k, even though these sets have measure zero. For the case 
k = 1 in particular, this is useful for the study of boundary value problems. 
We shall prove this for the special case of linear subspaces here; the general 
case will then follow from some results of the next section. 

To be precise, let us regard R" as R"~* x R* with coordinates y € 
IR"-*, z € R* and dual coordinates 7,¢. We define the restriction map 
R: §(R") — S(R"~*) by 


Rf(y) = Fly, 0). 
(6.9) Theorem. 
Ifs > 5k, the restriction map R extends to a bounded map from H,(R") 


to Hy_(x/2)(R"~*). 


Proof: It suffices to show that ||Rf||.-(¢/2) is dominated by ||f||. for 
f € 8. We have 


fen Rtm dn = Rf(y) = f(y, 0) = // et" Fn, C) dn dé 
for all y, so Rf(n) = f F(n,¢) dC. Therefore, 
IRF(n)P = [/ F(n, 0) + lal? + 16?) + InP? + cP)? ac| 
< [ fiftn.cora + in? + tery’ ac] | f+ i + cP). 
Setting 1+ |n|? = a? and |¢| =r, the second factor on the right is 


oo oo 
wr | (a? tye Ndr = at My f (14 7r?)~?r*o! dr. 
0 0 
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The latter integral is finite when s > 3k, so 


IRF(m)? < C+ ny" | 1Fa.orC + In? + I¢l?)* de, 


or 
IRF(n)P?C1 + In?) -F < c, [fora + Inl? + [¢|?)* ae. 


Integrating both sides with respect to 7, we conclude that RFU? _ (2) < 
C,||f|I2 as desired. 1 


Before proceeding further, we present two simple lemmas that will be 
used a number of times in the sequel. 


(6.10) Lemma. 
For all £,n ER" and alls ER, 


( 1+ |é? 
1+ |n|? 


) < 21 + fe — yl?) 


Proof: Since |€| < | — n] + |n|, we have |é|? < 2(€ — 9]? + |n]?), so 


1+ |€|? < 201 + 1€ — nl?)(1 + |n]?). 


If s > 0, we have merely to raise both sides to the power s. If s < 0, 
we apply the latter result with € and 7 interchanged and s replaced by 
|s| = —s. i] 


(6.11) Lemma. 
Suppose r <s <t. For any € > 0 there exists C > 0 such that for all 
fe, 

ISIS < el fll? + CHS. 


Proof: Given € > 0, choose A > 0 large enough so that (1 + A)’ < 
e(1+ A)*, and set C = (1+ A”)*-". Then 


2\r ss: 
a+r s {ORY TS 4} sca +key soa +iery. 


It follows immediately that ||f||? < el] f{[? + CIF[I? for any f € He. t 
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We now show that multiplication by Schwartz functions preserves the 
H, spaces. If s is a positive integer, this is clear from Theorem (6.1) and the 
product formula for derivatives. For the general case, a different argument 
is needed. 


(6.12) Proposition. 
If ¢ € §, the map f — ¢f is bounded on H, for every sER. 


Proof: The proposition amounts to the assertion that the map f > 
A’ dA7*f is bounded on Ho = L? for every s, where A® is given by (6.4). 
But since the Fourier transform converts pointwise multiplication into con- 
volution, 


(areas) = f KEM An) dn 
where 
K(Eé,n) = (1+ (€)?)*/76(E = 0) + In f?)7°/?. 
By Lemma (6.10), 


[K(E,m)| < 21/201 + Je — ml?) 16(E — 0), 


so f |K(é,)|dé and f |K(€,n)|dq are bounded by 
ale? a+ PIB E1 ae, 


which is finite since é is rapidly decreasing at infinity. The proposition 
then follows from the generalized Young’s inequality (0.10). ] 


Proposition (6.12) shows that the Sobolev spaces are preserved by mul- 
tiplication by smooth cutoff functions, so they can be localized. To be 
precise, if s € R and 2 is an open set in R”, we define the localized 
Sobolev space H!°¢(Q) to be the set of all distributions f on 2 such that 
for every bounded open Q% with Qo C 2, f agrees with an element of H, 
on Qg. Alternatively, we have the following characterization of H!°°(Q): 


(6.13) Proposition. 
f € H'°(Q) if and only if df € H, for every ¢ € C&(Q). Moreover, 
H, C H'°°(Q) for every open QC R*. 
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Proof: Clearly the second statement follows from the first one, in 
view of Proposition (6.12). If f € H!°¢(Q) and ¢ € C%°(Q), then f agrees 
with an element of H, on the support of ¢, whence ¢f € H, by Proposition 
(6.12). Conversely, suppose df € H; for all 6 € C&(Q) and No is an open 
set with compact closure in 2. Choose ¢ € C2°(Q) with ¢ = 1 on No 
(Theorem (0.17)); then f agrees with df € H, on Qo. ] 


Roughly speaking, the condition f € H!°°(Q) means that f has the 
requisite smoothness for being in H, on Q but imposes no global square- 
integrability conditions on f. 

We conclude this section by remarking that there are L? analogues of 
the Sobolev spaces for 1 < p < o0. On the simplest level, if k is a positive 
integer one can consider the space Lf of L? functions f whose distribution 
derivatives O° f are in L? for |a| < k; this is a Banach space with norm 
Dlal<k \|O% filer. If 1 < p < oo, it can be shown (although it requires a 
fair amount of theory to do so) that f € L? if and only if A¥f € L?, where 
A* is defined by (6.4). One can then define L? for any s € R to be the set 
of tempered distributions f such that A‘f € L?, and much of the theory 
for the L? Sobolev spaces can be extended. In particular, the analogue of 
the Sobolev lemma is that L? C C* if s > k+ (n/p); one also has the 
embedding theorem L? C L} when q > p and p7! — q7! = n71(s — 12). See 
Stein [45], Adams [1], and Nirenberg [38]. 


EXERCISES 


1. Fill in the details of Remark 1 following the Sobolev lemma to show 
that if H, C BC* then s>k+ dn. 


2. Show that ifs = 4£n-+a where 0 < a < 1 then ||6, — dy||_-, < Clz—yl|* 
for all z, y € IR", where 6, and 6, are the point masses at z and y. (Hint: 
6,(€) = e7 "it Write the integral defining ||6. — dy||2, as the sum 
of integrals over the regions |] < R and || > R where R = |z — y|-!, 
and use the mean value theorem of calculus to estimate Se - by over 
the first region.) Conclude that H, C C%, and more generally that if 
s= gn+k+a with0< a <1 then H, Cc Ct. 


3. In Example 3 following the Sobolev lemma, we implicitly used the fact 
that the distribution derivatives of f,(z) = [z|*4(z) coincide with its 
pointwise derivatives when the latter are integrable functions — namely, 
when the order of the derivative is less than A+. Prove this. (One 
way is to approximate f, by smooth functions.) 
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4. Suppose s > in, Prove that H, is an algebra, and more precisely 
that ||fgl]s < Cell fllslig{|; for all f,g € Hs. Do this via the following 
lemmas: 
a. Show that (1 + |€|?)*/?(fgV(€) = f K(E,n)u(é — n)v(n) dy where 
u,v € L? and K(E,) = (1+ |E[?)*/7(1 + [é — nf?) 9/91 + |?) 87. 
b. Show that K(E,) < C,(1+|nl)~*/? if €— nl > Sel and K(E,n) < 
Ci(1 + |€ — nl?)-*/? if | — nl < FIEl. 
c. Show that if u,v € L? and w(é) = [(1 + |n|?)7?/2u(é — n)v(n) dn 
then w € L? and ||w||z2 < Cy{lullzallull|z2. 


B. Further Results on Sobolev Spaces 


In this section we present a potpourri of theorems about Sobolev spaces. 

The most important ones are the Rellich compactness theorem, a character- 

ization of H, in terms of difference quotients, an interpolation theorem, and 

the local invariance of Sobolev spaces under smooth coordinate changes. 
If 2 is an open set in R®, we define 


H°(Q) = the closure of C2°(Q) in Hs. 


Thus H°(Q) consists of elements of H, that are supported in {, although 
in general not every such element belongs to H2(Q). If {f,} is a sequence 
in Co°(Q) that converges in H;, it also converges in H; when t < s; hence 
H°(Q) is a subset. of H?2(Q) for s >t. 


(6.14) Rellich’s Theorem. 

If Q is bounded and s >t, the inclusion map H8(Q) + H}?(Q) is compact. 
In fact, every bounded sequence in H9(Q) has a subsequence that converges 
in H, for every t < s. 


Proof: Suppose {f,} is a sequence in H2(Q) with || fells < C < oo. 
Choose ¢ € C&°(R") with ¢ = lon. Then fy = oft, 80 fr = 6*fe. Also, 
since ee 8 and fe is a tempered function, o* hi isa C™ function defined 
pointwise by the usual convolution integral. Thus, by Lemma (6.10) and 
the Schwarz inequality, 


(+lePy1A(O| 
< 2/2 ‘| 8(E — nC + 16 = nl?) 1A (my + [nl2)*7? ay 
< elt lili felle- 
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For the same reason, if j = 1,...,n, 
(1 + 1€1?)°/718; fe(€)I < 21/7] 2aéx jd |hillfelle- 


Since || fell, < C, the functions f, and their first derivatives are uniformly 
bounded on compact sets, so by the Arzela-Ascoli theorem there is a sub- 
sequence {fe;} that converges uniformly on compact sets, We claim that 
{fe;} converges in JJ; for t < s. Indeed, for any R > 0, 


Ife. — fasll2 = | (1+ [é}?)'Ifes — Fayl?(©) de 
lélg Rh 
+f G++ Py, - FsP@ ae 
l€)2>R 
< [sue lh. — PCO] fc + lerntag 

l€lgk l€l<Rz 

eae f+ ley fhe a 

l€|>R 


< [ow Ii -A0©] [C+ le ae 
lelsk lels& 
+ (4 RY, fl 


Now, given ¢ > 0, since t— s < 0 and ||fz; — fi, ll, < 2C, we can choose 
R large enough so that the second term in this last expression is less than 
de for all i,j. But then for all sufficiently large i and j, the first term will 
also be less than be. Thus {f;;} is Cauchy in Hy. 1 


The hypothesis that Q is bounded can be relaxed when s > 0 (see 
Adams [1] and Lair [32]), but some restriction is necessary: see Exercises 
1 and 2. 

While we are considering the spaces H°(Q), here is another useful result. 


(6.15) Proposition. 
If Q is bounded and k is a positive integer, the norms f — ||f{lk and 
f > Dyajeax llO* Fillo are equivalent on H?(Q). 

Proof: By Theorem (6.1), it is enough to show that 


(19° fllo $C D> [l8%fllo for [6] < &. 


laj=k 
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Consider first the case k = 1. We shall prove a sharper result, namely that 
if there are constants a and 6 such that a < z, < 6 whenever x € 2, then 
IIfllo < (&- @)|In fllo for all f € 9(9). 

It suffices to assume that f € Co°(Q). Writing z = (2’,2,) where 
a! = (x1,...,Z,—1), we have f(x) = f°" On f(z',t) dt, so by the Schwarz 
inequality, 


vers 


a 


Lin oe b 
Janf(e!.o)P at f at <(b-a) [ \Onf(2', t)|? dt 


for x € Q. Integrating over Q, we obtain 


b b 
iis < 6-0) fff ionste',n[Patae! den 
= (b= a)* 12 fi. 


Now for the case k > 1, the above argument shows that ||0° flo < 
(b — a)||0,0° filo for |G] < k, so the desired result follows by induction. 1 


Our next result is a more precise version of Proposition (6.12) that we 
shall need later. We recall that if A and B are operators, their commutator 
[A, B] is defined to be AB — BA. In particular, if B is multiplication by a 
function ¢, we shall abuse notation slightly and write [A, B] as [A, d]: 


[A,é]f = A(¢- f) -6- Af. 
(6.16) Lemma. 
Ifs ER ando > in there is a constant C = C,,¢ such that for all € § 
and f E Ay-1, 
HIA*, A] fllo < CllP|ljs—aj4149llflls—s- 
Proof: Setting f = A!~*g, what we need to show is that 


II[A*, 6JA*~*allo < Cll¢llis—1}4142Ilgllo 


for all g € Ho = L?. Since the Fourier transform converts multiplication 
into convolution, we have 


((A*, )A! "9 (6) = | K(E,n)a(n) dn, 
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where 
K(E,m) = (1+ 1€P)°/? — (1 + Il?)°718(E = 0) + I?) 0-77. 
We claim that 


[(1 + [€[?)°/? - (1 + In}?)*/7) 


CON) Isle all + POP? + (1+ PO” 


Granted this, by Lemma (6.10) we have 


LK (E, mI S Isl LOC — a) |g — a [1 + ePO- P70 $I)? + 
< [s[2°-7/714(€ — nile = nl (+ lg — oP)? + 0) 
S Cald(€ — (1 + [= of?) ED? 


Thus f |K(é,)| dé and f |K(£,7)| dy are bounded by 


ca [aera + leryette a] pe | foster as] 
= Ca|llis—1)4140> 


so the lemma follows from Theorem (0.10). 
It remains to prove (6.17). Since 


SE] = [s0l(1 4 AC? g fal eV D?, 


by the mean value theorem of calculus we have 
\(1 + a?)*/? — (1 + b?)*/?| < Ja — b| sup |s|(1 +47)? 2)/? 
astcb 
< {s| [a — bj [(1+.a7)0-2/? 4 (1 + b7VC-Y/?] 


for all a,b > 0. Setting a = |é| and b = |y| and using the inequality 
ll — Inl| < I€ — nl, we obtain (6.17). ' 


(6.18) Proposition. 


Ifs € Rando > in there is a constant C = C,,, such that for all ¢ € § 
and f € H,, 


IIldflls < [sup \6(2| Hflls + Clldltis—rp4atellflle—r- 
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Proof: Since ||- {Jp is the Z? norm, by Lemma (6.16) we have 


[loflle = [IAA fllo < H6A? Fllo + {IEA*, 4) flo 
< [sup \6(2| {1A°fllo + Clldll—r4srcll flle—t 


r= 


< [our 16¢e)1] sll + Cll -atsatell fle. 

In our study of elliptic operators we shall sometimes wish to determine 
when certain derivatives of a function f € H, are also in H,. For this 
purpose we shall use Nirenberg’s method of difference quotients, which is 
based on the following simple result. 

First, some notation. If ¢ is a function on R” and z € R", we define the 
translate ¢, of ¢ by $:(y) = ¢(y+2). If f is a distribution, we then define 
the translate f, by the formula (f,, 4) = (f,¢-<). Next, let e1,...,€n be 
the standard basis for R®. If f is a distribution, h E R\{0},and1 <j <n, 
we define the difference quotient Ai f by 


Alfa ho" (fre; — f): 


We also introduce the following notation for products of difference quo- 
tients. If a is a multi-index and hh = {hjp:1 <j <n, 1<k<a;}isa 
set of |a| nonzero real numbers, we define 
n a ; 
An = II II Nii’ 
jrlk=1 
We also define |h|, the “norm” of h, to be 
n az 
Ih] = D> > layed. 
j=lk=l 
(6.19) Theorem. 


Suppose u € H, for some s € R and a is a multi-index. Then 


|O* fils = lim sup JA flls 
{h]|--0 


(whether both sides are finite or infinite). In particular, 0.f € H, if and 
only if Af f remains bounded in H, as |h| > 0. 
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Proof: For simplicity we shall present the proof for |aj = 1, ie., 
Ag = Aj; the argument in general is exactly the same. In the first place, 
. e2 tithes 1 . eine, Sin THE; > 
(As (@) = "fie = aierines STE Fey, 

Since |h7! sin whé;| < 7|€;| for all h and €;, clearly 
limsup||ASI2 < f (1 + el?) 2nigs0(@)P ae = lOsul2 


Moreover, since h~! sin thf; — 7€; as h — 0, it follows from the dominated 
convergence theorem that equality holds provided ||0;ul|, is finite. (We can 
even replace “lim sup” by “lim”.) On the other hand, if ||0;u|]; = 00, given 
any N > 0 we can find R > 0 large enough so that 


Joh + PY exits FON de > aN, 
lél<k 
which implies that for sufficiently small h, 
adsl? > xe + [EI?)"J2n~? sin whey /?1FCE)I? de > N. 
Thus ||A? fl], ~ 00 as A > 0. I 
We also need the following fact about difference quotients. 


(6.20) Proposition. 
If s € R and ¢ € §, the operator [Af, ¢] is bounded from H, to H,—jaj41 
with bound independent of h. 


Proof: First suppose |a| = 1, so Af = Aj. Clearly 


an (Of) ne; — of - [¢fne; — $f] 
f= h 


[Ai 4] = (Aid) fre;- 


Since the translations f — fre, are isometries on H, and Aii¢ converges 
smoothly to 4;¢ as h — 0, by Proposition (6.18) we have {[[Aj, é] fll, < 
C\lfll. with C independent of h. 

If ja| > 1, we can commute ¢ through the factors of Af one at a 
time, yielding [Ag, ¢] as a sum of terms of the form Ag [Ai, d]Ag., where 
a! +e; +a” = a. But by Theorems (6.3) and (6.19) and the result for 
jo| = 1, 

Ag 1A4, b) Ann flls—fol4a < IZ, Ann fl|s—joltta|4i 
S Cl Ate flle-totttart+1 
< Cll Flls-laltlo'|+lo"} +1 = Cllflls- ! 
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(6.21) Corollary. 

IfL= > pl<k ag0° is a differential operator of order k with coefficients 
in 8, then for any s € R the operator [A?, L] is bounded from H, te 
Hy_4-jo|41 with bound independent of h. 


Proof: Simply observe that AZ commutes with 0, so that [Ag, L] = 
S~[Ag, ag]0%, and apply Theorem (6.3) and Proposition (6.20). 1 


We now present an interpolation theorem for operators on Sobolev 
spaces. The proof, like the proof of the Riesz-Thorin interpolation the- 
orem for operators on L? spaces which it closely resembles, is based on the 
following lemma from complex analysis. 


(6.22) The Three Lines Lemma. 

Suppose F(¢) is a bounded continuous function in the strip 0 < Re <1 
which is holomorphic for 0 < Re¢ < 1. If |F(¢)| < Co for Re¢ = 0 and 
\F(¢)| < Ci for Re¢ = 1, then |F(€)| < C47°C? forReC =0,0<0<1. 


Proof: For € > 0, the function G.(¢) = C$*CypSe€C-O FC) sat- 
isfies |G.(¢)| < 1 for Re¢ = 0 or Re¢ = 1, and also |G,(¢)| — 0 as 
|Im¢} — co for 0 < Re¢ < 1. Applying the maximum modulus theo- 
rem on the rectangle 0 < Re¢ < 1, |Im¢] < R with R large, we see that 
IG-(C)| < 1 for 0 < Re < 1. Letting « — 0, we conclude that 


Co-*Cy IFO) = him |G-(¢)| <1 for Re¢ = a, 
which is the desired result. i] 


(6.23) Theorem. 
Suppose so < 5; and to < ty, and for¢ € C let 


s(Q)=(1—-C)sotesi, tC) =(1—-C)to+ Ctr. 
Suppose T is a bounded linear map from H,, to Ht, whose restriction to 
H,, is bounded from H,, to H,,. Then the restriction of T to Hyq) is 


bounded from H,(¢) to Hyg) for 0 < o < 1. More precisely, if ||Tf|lt) < 
Collflle. and (IT fll. < Crllflles, then ||TFllyo) $ Co~°CF IN Fllace)- 


Proof: Given ¢,y € §, let 


F() = (TAG | MOY). 
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Here A’ is defined for z € C just as in (6.4); for z = z+ iy we have 
A? = APA‘ and ||A* ||. = ||A*}lle = |[¢lls—2 since [(1 + |é[?)*¥/?| = 1. 
Also, since (1 + |€|*)#/? is an entire holomorphic function of z, it follows 
easily that F(¢) is an entire holomorphic function of ¢. The hypotheses of 
the theorem imply that when Re¢ = 0, 
IF) S$ ITA“ Sle MOV [to < CofA [Lao VII -t0 
= Col|A~**dffrollA*Y||-1. = Colldllollvllo, 


and likewise for Re¢ = 1, 


[F(Q) STATO Mle MOD. S Crllelloltdllo. 
Moreover, for Re€ = 0, 0< 0 <1, 
IF) STA P OPS [fell MOI to S CoA Alf s [AO Y||-t0 
= Colldllo(so-sipllPllo(r—to) S Colldlloll Piles —t0- 


Thus, by the three lines lemma, 
F(a) < Co~°CTIIdllollvllo, OS e< 1. 


But F(c) = (AX9TA~*(9)$ |p), and § is dense in Ho. Hence this estimate 
plus the self-duality of Ho implies that A“°)TA7~*(?) is bounded on Ho, 
and this in turn means that T is bounded from H,(,) to Hyg): 


IIT'Fllecoy = ACOTA~ A Filo 
S$ Cg7P CTIA Flo = CO~ CF Fllaco- 1 


As an application of Theorem (6.23), we obtain an easy proof of the 
local invariance of Sobolev spaces under smooth coordinate changes. 


(6.24) Theorem. 

Suppose Q and Q’ are open sets in R® and 0: Q + Q' aC®™ map with 
C® inverse. For any open set 2 with compact closure in Q', the map 
f + f 0 is bounded from H9(Q}) to H°(971(}4)) for alls € R. 


Proof: Let J : Q — Rand J : Q — R be the Jacobian deter- 
minants of © and O~!; thus J and J are C® and nonvanishing, and 
J(y) = [(J(@~1y)}-!. Since J is bounded away from zero on O71(9%) 
whenever 9 has compact closure in 9, for any f € H8(M) we have 


i FCW)? dy = / (f 0 ©)(2)?J(2) de > C i; I(f 0 O)(2)/? dz, 
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which proves the theorem in the case s = 0. In view of Theorem (6.1) and 
the chain rule, a similar calculation proves the theorem in the case where 
s is a positive integer. The case s > 0 then follows from Theorem (6.23). 
From this we obtain the case s < 0 by a duality argument. Let 24 be a 
neighborhood of &% with compact closure in 9’; then for all f € C2°(%), 


Ifo Olle = sup} | [irooyevateyaz 


9 €02(O-(M4)), [lall-» < i}. 


But for such f and g, we have 
[sooyaia@yae = [roo O-TH I) ay. 


That is, the adjoint of the map f ~ foOis g > J(g o@7!). The 
map g + go 7! is bounded from H?(O7'()) to HP(M) for t > 0 by 
the preceding argument, and the map h > Jh is bounded on AP (9) by 
Proposition (6.12) since J agrees with a Schwartz function on %. Hence 
g — J(g 0 @-?) is bounded from H2(O-!(M%)) to H2(2) ee > 0. 
Therefore, for s < 0 and f € C2°(), 


Ife Oll. < sup lll 20" lea 9 € OP(O), lll-+ <1} 
< Callflle 
which implies that f — f o© is bounded from H?(Q”) to H2(071(9")). I 


(6.25) Corollary. 
The map f > f 0 is a bijection from H!°*(2") to H'°<(Q). 


Proof: We use Proposition (6.13). Suppose f € H!°c(0'). If ¢ € 
CX(2'), then df € H°(Q”) for any precompact neighborhood 2” of supp ¢ 
in 2’, so (¢00)(f 08) = (Sf) oO € H,. But the map ¢ + $00 is clearly 
a bijection from C'°(9’) to C'9°(2), so this means that f o@ € H!°*(Q). 
The same argument, with © replaced by ©7!, establishes the converse. 1 


As a consequence of Corollary (6.25), the space H!°°(M) can be in- 
trinsically defined on ony C® manifold M, and the space H,(M) can be 
intrinsically defined whenever M is compact. Let us explain this in detail 
for compact hypersurfaces in R”, the only case we shall need. (For those 
who know about manifolds, this should be a sufficient hint for the general 
case.) 
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Suppose S is a compact C™ hypersurface in R". Then S can be covered 
by finitely many open sets U;,...,Ua¢ in R® such that form = 1,...,M 
there is a C@ map (with C©™ inverse) ©,, from Uj, to a neighborhood of the 
origin in R® such that On,(S9U,,) is the intersection of On(Um) with the 
hyperplane xz, = 0; we identify this hyperplane with R"~!. Let ¢,,...,¢n¢ 
be a partition of unity on S subordinate to the covering {U;,} (Theorem 
(0.19)). Suppose f is a function on S, or more generally a distribution on 
S (= a continuous linear functional on C(S)). We say that f € H,(S) if 
(Gn f)° O72 € H,(R"~*) for each m. We can define a norm on H,(S) that 
makes H,(S) into a Hilbert space by 


FI? = SO Gm) ° Oni I?. 
1 


If we choose a different covering by coordinate patches or a different par- 
tition of unity, the norm we obtain will be different from but equivalent to 
this one, by Theorem (6.24) and Proposition (6.12); hence the space H,(S) 
is well-defined, independent of these choices. 

If we combine this construction with Theorem (6.9), we immediately 
obtain: 


(6.26) Corollary. 

Let S be a compact C® hypersurface in R". If s > i, the restriction map 
f — f\|S from 8(R") to C™(S) extends to a bounded linear map from 
#,(R") to H,~(3/2)(S). 


EXERCISES 

1. Suppose 0 # ¢ € CO°(R") and {a;} is a sequence in R® with |aj| — 00, 
and let ¢;(x) = o(z — a;). Show that {¢;} is bounded in H, for every 
s € R but has no convergent subsequence in H; for anyt ER. 


2. In Exercise 1, replace ¢ by the point mass 6. Conclude that if s < —jn, 
the inclusion map H2(Q) — H?(Q) (t < s) is never compact unless Q 
is bounded. 


3. Fill in the details of the argument preceding Corollary (6.26) to show 
that the space H,(S) (S a compact C® hypersurface in IR”) is well- 
defined. 


4. Deduce from the result of Exercise 4 in §6A that if s > Zn and 6 € Hg, 
the map f — ¢f is bounded on A; for |t| < s. 
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C. Local Regularity of Elliptic Operators 


As the first application of the Sobolev machine, we shall derive the basic 
L? regularity properties of an elliptic operator L with C© coefficients. The 
method is first to prove estimates for the derivatives of a function u in terms 
of derivatives of Lu, assuming that these derivatives exist; such estimates 
are known as a priori estimates. One then uses these estimates to show 
that smoothness of Lu implies smoothness of u. 
Let 
L= \~ aa(z)d? 
lol<k 

be a differential operator of order k with C@™ coefficients a, (a qualification 
that will be assumed implicitly hereafter). We recall that L is elliptic at 
xo if Dlel=k aq(xo)E* # 0 for all nonzero & € R". In this case we clearly 
have 


(6.27) > Algl* 


a Aq(xo)E* 


lalSk 


for some A > 0, since both sides are nonzero and homogeneous of degree k. 
(A can be taken to be the minimum value of the left side of (6.27) on the 
unit sphere |¢| = 1.) Moreover, since the a,’s are smooth, if L is elliptic 
on some compact set V, the constant A can be chosen to be independent 
of a9 EV. 

Our first major result is the following a priori estimate. 


(6.28) Theorem. 

Suppose Q is a bounded open set in R" and L = })),)<;, 4a0° is elliptic on 
a neighborhood of %. Then for any s € R there is a constant C > 0 such 
that for all u € H9(Q), 


(6.29) lulls < C(lLulls—x + lIells-1). 


Proof: The argument proceeds in three steps: first we do the case 
where the a,’s are constant and ag = 0 for |a| < k, then we remove the 
restriction that the a,’s be constant for |a| = k, and finally we do the 
general case. 

Step 1. Suppose ag = 0 for |a| < k and ag is constant for |a] = k. 
Then for u € H, we can express Lu by the Fourier transform: 


Lu(€) = (2ni)* S* agile). 


Jol=k 
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Therefore, by (6.27), 
(1+ [€1?)*1@(E)? < 2° + [€1?)°-* (1 + (EI?) a)? 
< PAL + [€P) Lue? + 2 + EP) ALE). 
Integrating both sides and using the inequality ||u||,_~ < ||u{],_1, we obtain 


lulls < QF AT Luly, + 2* lull 


s-1) 


which gives 
Itulle < Co(||Lul]s—« + [lulle-1) 
with Co = 2*/? max(A~!/?, 1). 
Step 2. Assume again that ag = 0 for ja{ < k, but allow ag to be 
variable for ja| = k. For each zo € Q, let 


Ly, = Ss @q(xr9)0%. 


fal=k 


Since 2 is compact, we can take the constant A in (6.27) to be independent 
of zo € 2, so by the result of Step 1, 


(6.30) llulls $ Co(|Leoulls—e + Helle) 


for all u € H,, where Co is independent of rq. Our plan will be to estimate 


[|Lu —Leoullez =|] D> [aal-) — aa(zo)]O%u 


laj=k 


s- 


for u supported in a small neighborhood of zo, and then to write an arbi- 
trary u € H0(Q) as a sum of functions supported in small sets. 

There is no harm in assuming that ag € Co°(IR"), as we can multiply 
the ay’s by a smooth cutoff function that equals 1 on Q without affecting 
Lu for u € H9(Q). Thus there is a constant C, > 0 such that 


|ag(t) -—de(zo)| <Cile-—zol (jal=k, c ER”, zo EQ). 
Set 6 = (4(2an)*CoC1)7}, and fix ¢ € C°(Bo5(0)) with 0 < ¢ < 1 and 
¢ = 1on B,(0). 
Suppose u € H2(Q) is supported in Bs(zo) for some ro € 2. Then 


[aa(t) — aa(x0)]O% u(x) = Yro,0(2)0% u(x) 
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where 
Ye0,a(z) = $(z — 20)[4a(z) — aa(zo)]. 
We have sup, [%z,,0(z)| < Ci(25) = (2(27n)*Cp)-}, so by Proposition 
(6.18), 
Waa(-) ~ aa(z0)}O*ul|s—z < (2(2mn)*Co)-*JO%ulfs—z + Ca||O7ulls—e—1 
< (2n*Co)“* Jul]. + (24)*Callulle—s, 
where C2 depends only on ||zo,0(Z)||js—z-1}4n41 and so can be taken in- 


dependent of zo. Thus, since there are fewer than n* multi-indices a with 
Ja] < k, we have 


||Lu — Le, ullse < Ss, [fea(-) — aa(z0)]O%ulls—x 
lal<é 
< (2Co)~*|Julls + (2xn)* Collulls—1. 
Combining this with (6.30), 


llulls < Co(ILulls—e + [Leou— Lulls—s + |lulls—1) 
$< CollLulls—z + Fllulls + [(2xn)*C2 + 1]Collulls—s, 
so that 
Wlulls < Ca(I|Lul]e—z + {lulls—1) 
where C3 = 2[(24n)*C2 + 1]Co is independent of zo. Thus the estimate is 
established for u supported in Bs(zo) and is uniform in zo. 

Since Q is bounded, we can cover 2 by a finite collection of balls 
Bs(a1),.--,Bs(an) with 2; € 2. Let {¢;}" be a partition of unity on 
0 subordinate to this covering (Theorem (0.19)). Then for any u € H°(Q), 
¢;u is in H2(Q) and is supported in a ball of radius 6 for each j, so 


N N 
So Gul < So liGull 
1 4 1 


N 
SCs DULG ule + [Iejulle-2)- 
1 


lulls = 


But L(¢ju) = GLu+([L,¢;Ju, and [L,¢;] is a differential operator of or- 
der k — 1 with coefficients in CS° (check it!), so by Theorem (6.3) and 
Proposition (6.12) we obtain the desired estimate: 


N 
lulls < Ca So (lN¢j Lulls—e + ULE, Ci]ulle—e + IG; ulle—2) 


< Cs ([[Lulls—e + [lulls-2). 
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Step 3. Now consider a general L = L° + L!, where 


L° = 5. a,0%, B= » 40%. 


JaJ=k laf<k 
By the result of Step 2, we know that for all u € H°(Q), 
lulls < Ca(ILulls—e + lulls). 
On the other hand, as in Step 2 we may assume that the coefficients of L} 
are in C%°, so by (6.3) and (6.12) again we have ||Lul],-~ < Cs|lul],—1. 


Therefore, 
lulls < Ca(Lulle—a + []L*ulls—e + [lulls—1) 


< C(||Lulls—« + |lulls—1) 
where C’ = C4(Cs5 + 1). The proof is complete. | 


(6.31) Corollary. 
For any t < s — 1 there is a constant C; > 0 such that for all u € H9(Q), 


ilulls < Ce(l|Lulls—x + [lulls). 
Proof: By Lemma (6.11), for some C/ > 0 we have 
llulle-a < (2C)7*|Jul]. + Cellulle, 
where C is the constant in (6.29). Substituting this into the (6.29) yields 
[lulls < 2C(||Lul]s—z + Crllulle). ' 


We now use the Theorem (6.28) to prove the local L? regularity theorem 
for elliptic operators. 


(6.32) Lemma. 

Suppose Q is an open set in R” and L is an elliptic operator of order k 
with C® coefficients on Q. If u € HI°(Q) and Lu € H'%,,(Q), then 
ué HYS,(Q). 


Proof: According to Proposition (6.13), we must show that yu € 
H.41 for all y € C2°(Q). By hypothesis, pu € H, and pLu € Hs_z41. 
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Also, [L, ¥] {this notation was introduced before Lemma (6.16)] is a differen- 
tial operator of order k —1 with coefficients in C°(Q), so [L, plu © Asn 
by Theorem (6.3) and Proposition (6.12). Hence, 


L(pu) = pLut [L, plu € As-e41. 


We shall apply the method of difference quotients. If1<j<nandh#0 
is sufficiently small, the distributions Al (pu) are supported in a common 
compact subset 2! of 2, so we can apply Theorem (6.28) (on 2’) to them. 
Combining this with Corollary (6.21), we obtain 


[AL (Pu)le S C(LAZ (Pull. —e + AR (u)Ile-1) | 
S C(WAAL(Wu) lea + ILL, AR) (Hu) lee + AL (Hu)lle-1) 
< C(ALL(Yu)lls—e + C'lldulle + 14 (Yu)lle—2). 


Now apply Theorems (6.3) and (6.19), first to deduce that the right side 
remains bounded as h — 0, and then to conclude that 0;(#u) € H, for all 
j and hence that pu € Hs41. i] 


(6.33) The Elliptic Regularity Theorem. 

Suppose Q is an open set in R" and L is an elliptic operator of order k 
with C™ coefficients on 2. Let u and f be distributions on 2 satisfying 
Lu = f. If f € H'°°(Q) for some s € R, then u € H'°%,(Q). 


Proof: Given ¢ € C2°(2), we wish to show that du € H,44. Choose 
Yo € CS°(Q) such that %o = 1 on a neighborhood of supp ¢. By Corollary 
(6.8), pou € Hy for some t € R. By decreasing t if necessary, we may 
assume that N = s+k—d is a positive integer. Proceeding inductively, 
choose C™ functions ¥1,...,Wn—1 such that ¥,; is supported in the set 
where ¥;-1 = 1 and 4; = 1 on a neighborhood of supp¢. Finally, set 
on = ¢. 

We shall prove by induction that yju € H14;, and the Nth step will es- 
tablish the theorem. The initial case j = 0 is true by assumption. Suppose 
then that yju € Hix; where 0 < j < N. Then pyy1u = dyyidju € Wty; 
and L(yj;u) = Lu = f on supp %j41. Moreover, 


L(bjait) = L(bjaidju) = djaiL(oju) + (L, djail(yyu) 
= dail + (L, dj 4il(yysu) 
€ Hy + Higj—eai = Ata j—eti- 


By Lemma (6.32), yj4iu is in Hi9541(Q), hence in Hr4j41 since it is 
compactly supported. | 
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(6.34) Corollary. 
Every elliptic operator with C™ coefficients is hypoelliptic. 


Proof: If Lu = f € C™(Q), then f € H!°°(Q) for all s, so u € HI°¢(Q) 
for all s. By Corollary (6.7), u € C™(Q). | 


If L is elliptic and has analytic coefficients, then u is analytic on any 
open set where Lu is analytic. This can be proved by using Theorem (6.28) 
and keeping an excruciatingly careful count of the constants to show that 
the Taylor series of u converges to u; see Friedman [21]. A more illumi- 
nating method is to show that elliptic operators with analytic coefficients 
have analytic fundamental solutions — that is, if L is elliptic with analytic 
coefficients on , there is a distribution A(z, y) on QQ that is an analytic 
function away from {(z,y) : z = y} and satisfies L, K(z,y) = 6(z—y). One 
can then argue as in the proof of Theorem (2.27). For the construction of 
the fundamental solution, see John [29]. 

The analogues of Theorems (6.28) and (6.33) for the L? Sobolev spaces 
are valid for 1 < p < oo, but the proofs are deeper and require the Calderén- 
Zygmund theory of singular integrals. The corresponding L! and L™ esti- 
mates are generally false; in particular, it is not true that if Lu € C™(Q) 
then u € C™+*#(2). However, if 0 < a < 1 and Lu € C™+9(Q) then 
ué O™+*+2(0). (We proved this for L = A in §2C, and the proof in gen- 
eral is in the same spirit, relying on a generalized form of Theorem (2.29).) 
These results follow from the construction of an approximate inverse for el- 
liptic operators that we shall present in Chapter 8 together with estimates 
for pseudodifferential operators that can be found in Stein [46, §VI.5] or 
Taylor [48, §XI.2]. 


D. Constant-Coefficient Hypoelliptic Operators 


The ideas in the preceding section can be extended to prove hypoellip- 
ticity for operators other than elliptic ones. Indeed, in combination with 
some other techniques from algebra and functional analysis, they enabled 
Hormander to obtain a complete algebraic characterization of those op- 
erators with constant coefficients that are hypoelliptic. In this section we 
present Hormander’s theorem, not only as an elegant result in its own right 
but as a beautiful example of the interplay of different areas of mathemat- 
ics and as an application of Sobolev spaces in which the use of spaces of 
fractional order is crucial. (However, we omit the proof of one part of the 
theorem that is purely algebraic.) 
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We begin with some notation. It will be convenient. to dispose of certain 
factors of 277 that arise in Fourier analysis by setting 
1 a 
= Omit 
Every polynomial P in n variables with complex coefficients then defines a 
differential operator P(D): 
POS. dock" PID) = DF ee; 


lal<k lal<k 


1 1 5 
D= oni” 1.€., D; = ani and D 


and every constant-coefficient operator is of this form. If P is a polynomial 
and a is a multi-index, we set 


P(*) = Q*P, 
and we then have the following form of the product rule: 
1 
(6.35) P(D)[Fa] = D9 Gide sLP(D)al. 
lal20 


(The proof is left to the reader; see Exercise 1.) 

We shall need to consider complex zeros of polynomials. In what fol- 
lows, € and 7 will denote elements of IR" and ¢ will denote an element of 
C”. For any polynomial P we define 


Z(P) = {CE C* : P(C) = 0}. 


Z(P) is always unbounded when n > 1 (unless P is constant), because 
for any ¢1,...,¢,-1 € C, no matter how large, there exist values of ¢,, for 
which P((,...,¢n) = 0. For € € R”, we define dp(€) to be the distance 
from & to Z(P): 

dp(€) = inf{\¢ —€|:¢ € 2(P)}. 


Here, then, is Hormander’s theorem. 


(6.36) Theorem. 
If P is a polynomial of degree k > 0, the following are equivalent: 
(H1) [Im¢| — 00 as ¢ — 00 in the set Z(P). 
(H2) dp(£) ~ 00 as € > 00 in R". 
(H3) There exist 6,C, R > 0 such that dp() > Clé|® when € € R", |é| > R. 
(H4) There exist 6,C,R > 0 such that [P((E)| < Clé{-5!*!|P()| for all & 
when £ € R", |E| > R. 
(H5) There exists 6 > 0 such that if f € H'°°(Q) for some open 2 C R", 
every solution u of Lu = f is in H'9S,.(Q). 
(H6) P(D) is hypoelliptic. 
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Proof: Before beginning the labor of the proof, we make a few re- 
marks. First, our arguments will show that if P satisfies (H3) for a par- 
ticular 6, it satisfies (H4) for the same 6; and if it satisfies (H4) for a 
particular 6, it satisfies (H5) for the same 6. In fact, the optimal 6’s for all 
these conditions are equal. 

Second, some of the implications in the theorem are easy. A moment’s 
thought shows that (H1) and (H2) are equivalent, and clearly (H3) implies 
(H2). Moreover, (H5) implies (H6) in view of Corollary (6.7). 

We refer the reader to Hérmander (26] or (27, vol. II] for the proof that 
(H2) implies (H3), which is purely a matter of algebra. It requires some 
results from semi-algebraic geometry (the theory of sets defined by real 
polynomial equations and inequalities), specifically, the so-called Tarski- 
Seidenberg theorem. (These names should suggest to the reader that math- 
ematical logic is casting a shadow here. Indeed, Tarski and Seidenberg’s 
main concern was the construction of a decision procedure for solving poly- 
nomial equations and inequalities.) 

Taking the implication (H2) => (H3) for granted, then, to prove the 
theorem it will suffice to show that (H3) => (H4), (H4) => (H5), and 
(H6) => (H1). 

Proof that (H3) => (H4): We first claim that |P(é + ¢)| < 2*|P(€)| 
if |¢] < dp(€). To show this, consider the one-variable polynomial g(r) = 
P(€é+7¢). If |¢| < dp(€), the zeros 71,...,7% of g satisfy |7;| > 1, so as in 
the proof of Lemma (1.53), 


IPE+O] _l9D|_ a la- 
er fol = Hl 


Now, by applying the Cauchy integral formula to P in each variable, we 
see that 


a PE+¢) 
P((E) = fine sot ie i roe dC, ---dCn 


for any r > 0. If we take r = n~*/?dp(€) then |¢] = dp(€) when [G| = r 
for all j, so that |P(€ + ¢)| < 2*|P(€)|, and hence 


(a) a!2*|P(é)| 
POLS Faia QT 
Of course P(“) = 0 for |a| > k, so this estimate together with (H3) gives 
(H4). 
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Proof that (H4) => (H5): This argument is similar to the proof of 
Theorem (6.30), with an extra twist. Suppose f € H!°(Q) and Lu = f. 
Given ¢ € C%(Q), we wish to prove that ¢u € H;445. Choose po € C2°(2) 
with Wo = 1 on a neighborhood of supp ¢. By Corollary (6.8), wou € A; 
for some t € R, and by decreasing ¢ if necessary we may assume that 
t= s+k—1-— M6 for some positive integer M. Choose C™ functions 
w1,---;¢m inductively so that Ym is supported in the set where Ym—1 = 1 
and %m = 1 on a neighborhood of supp ¢, and let pagy1 = @. 

Since Yo = 1 on supp x, by (6.35) we have 


P(D)lvau] = Wi P(D)u + S> (Di) PO"D) you] 
a0 


€ A, + se Atek+lo| = Ai-k41. 
a#0 


That is, 
[A+ HPO UTEP de < co. 
Thus by condition (H4), 


fo + je [2)t E+ 1+ Slal| PCE) uy (€)/? dé < 00, 


which means that P()(D)[yiu] € Hi-n+i4sjaj- Next, since ¥; = 1 on 
supp #2, by (6.35) we have 


P(D) [Yau] = vaP(D)u + Sy (Dv) POD) vu) 
a#0 


EH, + ys Hy k+i4sjo| = At e414, 
a0 


so the argument above shows that P(*)(D)[pau] € He-n4146(04/a1)- Con- 
tinuing inductively, we find that 


PO (D)[vmu] € He-24146(m-14l a1)» 
so finally, for m= M +1, 
P(O(D) [gu] = PO (D)[vargru] € Ar-egigecmslat) = As tajay- 


Now, if P(E) = Djajce CaS", Pick @ such that |a| = k and cg # 0. Then 
P(@)(£) = alca $ 0, so du = (aleg)“1 P()(D) [gu] € Heyes, and we are 


done. 
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Proof that (H6) => (H1): Suppose P(D) is hypoelliptic. Let B = 
B,(0), and set 


N= {f € L?(R"): f =0 outside B and P(D)f = 0 in B}. 


(The elements of N are C© functions on the complement of OB, but they 
are in general discontinuous on 0B. For example, if g is any solution 
of P(D)g = 0 on a neighborhood of B then gxg EN.) If {fm} CN and 
fm — f in L?, then f = 0 outside B and P(D)f is a distribution supported 
on OB since the same is true of P(D)fm for all m; hence f € N. Thus N 
is a closed subspace of L?. 

Next, let B’ = B,/2(0). The elements of N are C® on B. in particular, 
if f EN then Dj f € L?(B’) for 1 <j <n. We claim that f + D;f|B' is 
a bounded linear map from the Hilbert space N to L?(B’), so that 


; ; x)? dx r)[? dz 
G3 Lf iaseracse fueiPde (Fe). 


By the closed graph theorem, it suffices to show that if fn — f in N 
and D;fm — g in L?(B’) then g = D;f|B’. But this is obvious: the 
hypotheses imply that D; fn — D;f and Dj fm — g in the weak topology 
of distributions on B’, so g = D; f on B’. 

Now, given ¢ € Z(P), let f(z) = e?**¢*yp(x). Then P(D)f(z) = 
P(c)e2"i&- 7 = (on B, so f EN. Moreover, Dj f(x) = Cje?"#4* for x € B, 
so by (6.37), 


Diet ff etomoede cc f etmords (Ce 2(P)). 
; B B 


If |Im¢| < R, this gives 


De Ine 4nR\z| dy 
US fe aeRil de” 


In other words, if ¢ varies in Z(P) but Im¢ remains bounded then ¢ remains 
bounded, which is precisely condition (H1). 5 


It may easily be imagined that the methods of the preceding section can 
be generalized to yield a hypoellipticity theorem for non-elliptic operators 
with variable coefficients, and this is indeed the case. Indeed, suppose 
L = 5 a,(x)D® is an operator with C™ coefficients on 2 C R”. If the 
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polynomials P,,(€) = S> aa(xo)E* (zo € ©) all satisfy the conditions of 
Theorem (6.36), and if these polynomials are all of comparable size in a 
certain precise sense, then L is hypoelliptic on 2. See Hormander [27, 
vol. II, §13.4]. However, this sufficient condition for hypoellipticity is far 
from being necessary in the variable-coefficient case. For example, the 
operator L = 0? + (dy + 20,)? is known to be hypoelliptic on R3, but the 
operator Ly = 02 + (dy +40,)? obtained by freezing the coefficients of L at 
a point (a,b,c) is not, because arbitrary functions of the form f(z, y,z) = 
g(z — ay) satisfy Lof = 0. The hypoellipticity of this Z follows from a 
beautiful theorem of Hérmander concerning hypoellipticity of operators of 
the form L = HEE - V)? + (Ao - V) where the A;’s are real C® vector 
fields; see Hormander (27, vol. IIT, §22.2}. 


EXERCISES 
1. Prove (6.35). (Hint: It suffices to take P to be a monomial.) 


2. Show that P(D) is elliptic if and only if P satisfies (H4) with 6 = 1, 
and that no P of positive degree satisfies (H4) with 6 > 1. 


3. Let P be a polynomial of degree k with real coefficients such that P(D) 
is elliptic on R®. Let Q(é,r) = 2mir — P(E), so that Q(Dz,D:) = 
&:—P(D). Show that Q satisfies condition (H4) (on R"*+?) with 6 = k7} 
but not for any 6 > k-!. (Hint: Consider the regions |€|* < |r| and 
\é|* > |r| separately.) 


E. Sobolev Spaces on Bounded Domains 


If Q is a bounded open set in R", we define the Sobolev space H,(%) for k 
a nonnegative integer to be the completion of C~(f) with respect to the 


norm 
1/2 
[iors] ; 
2a 


By Theorem (6.1), HP(&) is the completion of C&°(Q) with respect to the 
same norm, so we have H?() C Hx(Q) C H}es(). 

(If Q is a bounded domain with smooth boundary, it is possible to 
develop a theory of Sobolev spaces H,() of arbitrary real order. One 
defines H_,(Q) to be the dual of H?(Q) when k is a positive integer, and 


(6.38) Ivika= | 


lalsk 
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then defines H,(Q) for nonintegral s by an interpolation process; see Lions 
and Magenes [34]. However, we shall have no need of this refinement.) 

The following basic properties of H,({) are obvious from the defini- 
tions: 


i. If j < k, then || - lj, < fl - [lea and H,(Q) is included in H;(Q) as a 
dense subspace. 


ii. If ja] < &, 0% is a bounded map from H,(Q) to Hy—ja}(). 


iii. If 6 E C~(N), the map f — ¢f is bounded on Hy(Q) for all k. (This 
follows from the product rule for derivatives.) 


iv. H,(Q) is invariant under C® coordinate transformations on any neigh- 
borhood of 2. (This follows from the chain rule.) 


v. The restriction map f — f{Q is bounded from H;(R") to H,(Q). (This 
follows via Theorem (6.1) from the estimate fy [f|? < faa lfl?-) 


Our definition of H;,,(Q) is designed to trivialize the problem of ap- 
proximation by smooth functions. However, it would also be reasonable to 
consider the space of all functions on 2 whose distribution derivatives of 
order < k are in L?(Q), which we denote by W;(&): 


Wi (2) = { f € L2(Q) : A° Ff € L7(Q) for lal < k}. 


W,(&) is a Hilbert space with the norm (6.38). In general, H,(9) is a 
proper subspace of W;({2); see Exercise 1. However, if 02 satisfies a mild 
smoothness condition, the two spaces coincide. 

Specifically, we shall say that a bounded open set 2 has the segment 
property if there is an open covering Uo,...,Uy of 2 with the following 
properties: 

a. Uo Cc Q; 

b. Uj; NAAQS @ for j > 1, 

c. for each j > 1 there is a vector y € R” such that «+ 6y ¢ © for all 
xe€U;\Qand0<6<1. 

See Figure 6.1. In particular, if Q is a domain with a C) boundary, then 

has the segment property; see Exercise 2. 


(6.39) Theorem. 
If Q has the segment property, then H,(Q) = W,(Q). 


Proof: We need only show that W,(Q) C Hz(Q). Let Uo,...,Un be 
an open covering of Q as in the definition of the segment property. Choose 
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Figure 6.1. The segment property. The dotted lines represent the 
segments x +ty,0<t <1, for various z € S = @Q, and Ss; = 
{x + dy :2 € S}. 


another open covering Vo,..., Vw of % such that V; C U; for all j, and 
let {¢;}” be a partition of unity subordinate to the covering {V;}}’. If 
f € W,(Q), then clearly Gf € W,(Q), and it is enough to show that 
Gif € Ay(Q) for all j. 

For j = 0 this is easy. Choose ¢ € Co°(B,(0)) with fd = 1, and 
set (x) = e~"d(e7'x). Then (Cof) * de is C and supported in 2 for € 
sufficiently smal], and O° [(Cof) #¢<] = O%(Cof) * be > O° (Cof) in L?(Q) as 
€-+ 0 for |a| < &, by Theorem (0.13). Thus Gof € H?(Q) C Hy(Q). 

Writing f instead of ¢; f, then, it suffices to assume that f is supported 
in some V;, 7 > 1, where we extend f to be zero outside 2. Set S = 
GQNV;. Then f and its distribution derivatives of order < k agree with 
L? functions on R®\ S. For 0 < 6 < 1, define f°(z) = f(z — 6y’) and 
So = {r+ by : x € S}, where y is as in the definition of the segment 
property. Then f* and 0° f° (|a| < k) are in L? on R"\S®, f° is supported 
in U; for 6 sufficiently small, and (Ss = @. It follows easily from Lemma 
(0.12) that 


> [joes - arr? +0 2550, 


laisk 


so it is enough to show that f*[Q € H,(Q). Given 6 > 0, choose ¢ € C? 
such that ¢ = 1 on 2 and g = Onear S®. Then ¢f* € A; (R"), so df? is the 
limit in the Hy norm of functions in S(R"). It follows that f°|Q = ¢f*|Q 
is the limit in the norm (6.38) of functions in C*(Q), ie, f € Hi(M). 1 
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(6.40) Corollary. 
If Q has the segment property, then f € Hy+4;(Q) if and only if d°f € 
H,(Q) for |a| < j. 


Proof: The assertion is obvious if Hy4; and H; are replaced by Wy 4; 
and W,. i 


We now derive a useful construction for extending elements of H;,() 
to a neighborhood of 2 when Q is a domain with smooth boundary. The 
starting point is the following gem of linear algebra. If a1,...,a,, are 
complex numbers (or elements of an arbitrary field), the Vandermonde 
matrix V(a1,...,@m) is the m x m matrix whose jkth entry is (a;)*~!, 
1<j,kim. 


(6.41) Lemma. 
det V(a1,...,@m) = Thhejcrem(oe — aj). In particular, V(a,,...,@m) is 
nonsingular if and only if the a;’s are all distinct. 


Proof: Clearly det V(a;,...,am) = 0 if any two a;’s are equal, since 
then two rows of V(a1,...,@m) are equal. So suppose @1,...,@m~1 are dis- 
tinct, and regard det V(a;,...,@m) as a function of am. It is a polynomial 


of degree m— 1 that vanishes when a,, = a; for some j < m, hence equals 
eTTjy' (am ~a;). Here c is the coefficient of (an)"~*, which is nothing 
but det V(a,,...,am-1). The proof is therefore completed by induction. 1 


Returning now to Sobolev spaces, we introduce a notation that will be 
used repeatedly in the sequel: 


(6.42) N(r) = {x ER": |z| <r and x, < 0}. 


(6.43) Lemma. 

Let B, = B,(0), let Ck(B,) be the space of C* functions supported in B,, 
and let Ck(N(r)) be the space of C* functions on N(r) that vanish near 
|x| = r. For each positive integer k there is a linear map Ey : Ck(N(r)) — 
CK(B,) such that Exf|M(r) = f and |\Exflly < Cillflly.mce) for 0S 5 Sk, 
where C; is independent of f andr. 


Proof: By Lemma (6.41) the matrix V(—1,...,-#—1) is nonsingular, 
so there exist numbers c1,...,cx41 such that 
k41 


Y-)™r =1,  (m=0,...,k)- 


t=1 
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We define the map Ey on C¥(N(r)) by 


; _ f f(z) (tn <9), 
Er f(z) = { “ae cif(t1,-.-,%n-1,—ltn) (tn > 0). 


Then for z, > 0 and Jal < k we have 


k+1 


O° Ex f(z) = D(-N* (97 fai...) tn-1s—l2n) (tn > 0), 
i=1 


so the limits of 0° Ef as x, approaches zero from above or below are equal. 
Also, it is clear that E;,f is supported in B,, and hence Exf € Ck(B,). 
Finally, 


k+1 
[rma s fied retar| [mies 
B, ae N 
whence ||Ex fll; < Cyllfiljwcry with Cj independent of f and r. i] 


(6.44) Theorem. 

Let 2 be a bounded domain on R" with C® boundary, and let Q bea 
bounded neighborhood of 2. For each positive integer k there is a bounded 
linear map Ex : Hy(Q) > HY 9(Q) such that Ex f|Q = f. Ey is also bounded 
from H;(Q) to H9(2) forOQ<j<k. 


Proof: Let Vo,..., Vy be an open covering of 2 with the following 
properties: (i) Vm C © for all m, (ii) Vo C Q, Gii) for m > 1, Vm can 
be mapped to a ball B,(0) by a C@ map yp, with C™ inverse such that 
m(Vm 2) = N(r). Choose a partition of unity {Gn} on 9 subordinate 
to this covering, and define Ex f for f € C*(Q) by 


N 
Exf = Cof + Do[Ee (mS) © vn) ohm, 


m=1 


where the Ey on the right is given by Lemma (6.43). Then E;f is C* and 
is supported in ©, hence is in He). From Lemma (6.43) and the product 
and chain rules it follows that ||Exf|l; < C;ilfllj,a for 7 < k, so Ex extends 


uniquely to a bounded map from H;(2) to H9(Q). ' 


Remark: By refining the argument in Lemma (6.43) one can con- 
struct an extension operator E,. that works for all & simultaneously; see 
Seeley [44]. 
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With the aid if the extension map FE; we can easily obtain analogues 
of some of the major results of §6A and §6B for the spaces H;(Q). 


(6.45) The Sobolev Lemma. 
If Q is a bounded domain with C® boundary and k > j + in , then 
H,(Q) c C3(0). 


Proof: If f € Hi(Q) then Exf € H$(Q) C Hk(IR") C Ci(R"), 80 
fecha. ‘ 


(6.46) Rellich’s Theorem. 
IfQ is a bounded domain with C©® boundary and 0 < j < k, the inclusion 
map H,(Q) + H;(Q) is compact. 


Proof: If {fi} is a bounded sequence in Hi(Q), {Ex fi} is bounded 
in Hf(2), so a subsequence {Ey fi,,} converges in H}((), and then {fi} 
converges in H;(). i] 


(6.47) Theorem. 

If Q is a bounded domain with C®@ boundary and k > 1, the restriction 
map f — f|S from C*() to C*(S) extends to a bounded map from H;(Q) 
to Hy-(1/2)(0Q). 


Proof: This is immediate from Corollary (6.26), since E,f|@Q = 
FAQ for f € C*(Q). i] 


(6.48) Corollary. 
If f € Hi(Q), then 8° f|GQ is well defined as an element of L?(8Q) for 
la[<k-1. 


(This is also an easy consequence of Corollary (2.40).) 


(6.49) Corollary. 
If f € HP(Q) then O° f[8Q = 0 for lal < k —1. 


Proof: This is true for f € C>°(9), hence in general since C2°(Q) is 
dense in H?(Q). 1 


The converse of Corollary (6.49) — that if f € H,(Q) and 8% f|AN = 0 
for jal < k—1 then f € H£(Q) — is also true. The full proof is a 
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bit technical; it can be found in Adams [1, Theorem 7.55] or Lions and 
Magenes (34, Theorem 11.5]. We shall content ourselves with proving the 
following slightly weaker result. 


(6.50) Proposition. 
Suppose 2 is a bounded domain with C® boundary. If f € C*(Q) and 
O° f =0 on AQ for 0 < lal < k —1, then f € HP(Q). 


Proof: We use the same construction as in the proof of Theorem 
(6.39), noting that 2 has the segment property (Exercise 2). By using a 
suitable partition of unity, it suffices to assume either that f is supported 
in Q or that f is supported in a set V for which there exists y € R" such 
that «+ dy ¢ D for z € V\Q and 0 < 6 < 1. The first case is obvious: 
just convolve f with a suitable approximation to the identity to obtain it 
as the limit of elements of C°°(Q). For the second, we extend f to be 
zero outside 2; then f € C*~}(IR") and the k-th derivatives of f have only 
jump discontinuities along 09, so f € Hi(IR"). Define f°(x) = f(x + dy) 
for 0 <6 <1. Then f° is supported in 9, so f® € H2(Q) as above. But 
clearly f* — f in the Hy, norm as 6 —+ 0, so f € HP(O). 1 


Finally, we shall need the analogues of Theorem (6.19) and Proposition 
(6.20) for H,(N(r)), where N(r) is defined by (6.42). We note that if f is 
a function on N(r) that vanishes near |x| =r, then the same is true of the 
difference quotient Ap f for h sufficiently small, provided a, = 0. 


(6.51) Theorem. 

Suppose f € H,(N(r)) and f vanishes near |x| = r, and let w be a multi- 
index with a, = 0. Then the distribution derivative 0% u is in Hy(N(r)) if 
and only if 


A(f,a) = aa sup ||An flew < 00, 


in which case ||O% f\lk,n(r) < CeA(f,@) where C, is independent of f and 
r. 


Proof: Let Ey be the extension map of Lemma (6.43). If O%f € 
Hy,(N(r)), then E,0°f € H,. But O° E, f = E,O°f and ARE, f = E,Agf 
when a, = 0, so by Theorem (6.19), 


A(f,e) < se sup [An Ex fll = l]0° Ex fll < 00. 
—0 
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Conversely, if A(f,a) < oo, then 


Him sup [Ag Ee file = lim sup [Ie As file < CrA(f, a) < 00, 
—0 0 


where C; is the constant in Lemma (6.43). Hence 0°E, f € Hz, so O°f € 
Hy,(N(r)) and 


O° Flle,wr) S MO" Le flle < Ce Af, 2). ‘ 


(6.52) Proposition. 
Suppose ¢ € Cc™(N(r)) and a is a multi-index with a, = 0. There is a 
constant C such that for all f € Hy(N(r)) which vanish near |x| = r and 
all sufficiently small h, 


AR, dlfllewery $C D> 10" flle,wer)- 


Ba 


(Here “G < a” means that B; < a; for all j with strict inequality for at 
least one j.) 


Proof: Same as the proof of Proposition (6.21). i] 


EXERCISES 


1. Let Q = {re?? € C = R?: ~a < 6 < x} (the unit disc with a line 
segment removed), and define the function f on 2 by f(re®?) = 0 € 
(—7, 7). Show that f € W,(Q) \ Ai(Q). 


2. Show that if 2 is a bounded domain with C! boundary, then 2 has 
the segment property. (Hint: Take the sets U; for 7 > 1 to be small 
balls centered at suitably chosen points 27 € OQ, and y’ to be a small 
positive multiple of the unit outward normal at 2.) 


Chapter 7 
ELLIPTIC BOUNDARY VALUE PROBLEMS 


Let L be an elliptic differential operator with C™ coefficients and let N be 
a bounded domain in R” with C® boundary S. In this chapter we shall 
study the equation Lu = f on Q, where u is to satisfy certain boundary 
conditions on S, the object being to prove existence, uniqueness, and regu- 
larity theorems. Our approach will be to formulate the problems in terms 
of sesquilinear forms that generalize the Dirichlet integral of §2F and then 
to apply some Hilbert space theory. For many of these problems a version 
of Dirichlet’s principle holds — that is, solution of the problem is equiva- 
lent to minimizing the generalized Dirichlet integral in a suitable class of 
functions — and for this reason, boundary value problems set up in terms 
of Dirichlet forms are often said to be in “variational form.” However, the 
calculus of variations will play no direct role in our work. 


A. Strong Ellipticity 


In Chapter 2 we saw that we obtain a “good” boundary value problem 
for the equation Au = f by specifying either u or 0,u on the boundary. 
In general, for an elliptic operator of order k it turns out to be appropri- 
ate to impose ik independent conditions on the Cauchy data of u on the 
boundary. 

This causes problems if k is odd. For example, consider the Cauchy- 
Riemann operator 


= (0, + idy) 


on 2 C R?. Solutions of G-u = 0 are holomorphic functions of z = x + ty 
on 2, so the Dirichlet data (u = f on S = OM) are overdetermined: not 
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every function f on S extends holomorphically to 2. For example, if Q is 
the unit disc, a function f(e™?) = Dae ame'™® on the unit circle extends 
holomorphically if and only if a,, = 0 for m < 0, in which case the extension 
is u(z) = 05 a@mz™. Thus in some sense we can prescribe “half of the 
Dirichlet data” of u — namely, the Fourier coefficients a,, with m > 0. 
Fortunately, elliptic operators of odd order are not common: 


(7.1) Proposition. 
Let L = Dyaj<z 400% be elliptic at ro € R". If n > 3, or ifn = 2 and the 
numbers a@q(xq) are real for |a| = k, then k is even. 


Proof: Ellipticity means that the function 


o(€) = So aa(z0)é* 


la[=k 


is nonvanishing on R” \ {0}. For each nonzero é’ = (€1,...,€n-1) € R"~} 
let us consider the polynomial of degree k 


Pe (z) = o(€1,--.,€n-1,2) (z EC). 


Let Ny (€') {resp. N_(é’)] be the number of zeros of Pg: with positive [resp. 
negative] imaginary part. Since Pe:(x) # 0 for z real, Ny(é’) + N_(€’) =k 
for all é'. Also, it follows from Rouché’s theorem that Ny and N_ are 
locally constant on R"~! \ {0}. For n > 3 this set is connected, so Ny 
and N_ are constant. But N4(—é’) = N_(é') since o(—€) = (—1)F o(€), so 
Ny = N_, and hence k = Ny + N_ = 2N, is even. 

On the other hand, suppose n > 2 and the numbers a,(zo) are real. 
Since o(€) is nonzero for € € R"\{0} and the latter set is connected, o must 
be everywhere positive or everywhere negative. Since o(—€) = (—1)*a(€), 
this forces k to be even. a 


It is also possible for strange things to happen with operators of even 
order. Consider the following example, due to A. Bitsadze: The operator 


02 = 4 (02 + 210, dy — 97) 
is elliptic on R?, and the general solution of 02u = 0 is 


u(z,y)=f(z)+7%9(z)  (z=2+iy) 
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where f and g are holomorphic functions. In particular, if we choose f 
holomorphic on the unit disc B and continuous on B, and set g(z) = 
—zf(z), we see that u = (1 — |z|?)f(z) solves the Dirichlet problem 


G2u = 0 on B, u=0on @B. 


Since there are many such f’s, the solution of this Dirichlet problem is far 
from unique, and we can say virtually nothing about. the smoothness of the 
solution along the boundary. 

We therefore introduce a restricted class of elliptic operators which will 
exclude such pathological cases. We say that the operator 


L= > a,0* 


lolse 


(with C® coefficients, as usual) is strongly elliptic on 2 if there is a C™ 
complex-valued function y of absolute value 1 on 9 and a constant C > 0 
such that 


Re| (2) DD sa(2)e°| >Clel* (€ER", rEN). 


laj=k 


It follows that any elliptic operator with real coefficients on 9) is strongly 
elliptic on 9 (take y = £1), and that a strongly elliptic operator has even 
order k = 2m (because (—£)* = (-1)leléey, Since the equation Lu = f 
is equivalent to the equation (—1)"yLu = (—1)"yf, by replacing L by 
(—1)"yL we may assume that 


(7.2) (-1)™Re J? aa(zew>ClEP™ (EER, 2 EN). 
{a|=2m 

Henceforth, when we speak of a strongly elliptic operator we shall assume 

that this normalization has been made. (In particular, this means replacing 

the Laplacian A by —A. The significance of the factor of (—1)” will appear 

in the next section.) 


EXERCISES 
1. Show that the Bitsadze operator 2 is not strongly elliptic. 


2. Let (z,y,wi,..-,Wn) be coordinates on R"+?, and let L be a second 
order operator in the variables wi,..., Wn. 
a. Show that the operator 02 + L (z = x + iy) cannot be elliptic at 
any point of R"+?, 
b. For which L is 0? + a + LE elliptic on R"+?? 
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B. On Integration by Parts 


Henceforth Q will denote a bounded domain in R" with C™ boundary S, 


and 
B= > a,0% 


la|<g2m 


will denote a strongly elliptic operator on © satisfying (7.2). Also, (v|u) 
will denote the inner product of v and u in L?(Q), ie., (v|u) = fq vt: 

The formal adjoint of L is the differential operator L* on © deter- 
mined by the formula 


(L*v|u) = (v| Lu) for all u,v € CO°(Q). 


(This differs from the dual of ZL that we used in defining the action of L on 
distributions in that we are using Hermitian inner products.) Working out 
the integration by parts implicit in this formula, we see that 


Lv= S> (-1)la7 (a0). 


la}<¢2m 


In particular, the 2m-th order part of L* is islisan @,0%, so L* is strongly 
elliptic whenever L is. 

Frequently it will be convenient to integrate by parts only m times to 
obtain an expression 


(vj Lu) = S>  (8%v|aepd%u). 


le [4lgm 


In general, a sesquilinear form (linear in the first variable, conjugate-linear 
in the second) of the type 


(7.3) D(v,u) = a (8%v | dap’ u) (dag € C™(M)) 
lel, lalgm 


is called a Dirichlet form of order m. D is called a Dirichlet form for 
the operator L if 


D(v, u) = (v| Lu) for all u,v € CO°(Q). 


We have already met in §2F the special case of this idea which historically 
provided the inspiration for the general construction, namely the Dirichlet 


form 
nn 


Div, x) = S>(8;013ju) = f Vo Tu 


1 
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for —A. 
Any operator L has many different Dirichlet forms. For example, in 
dimension n = 2, another Dirichlet form for —A is 


D'(v, u) = ((de + id, )v| (Be + id, )u) = 4(8z0 | Oru). 
For another example, two Dirichlet forms for A? on R? are 
Di(v,u) = (Av} Au), 
Da(v,u) = (02 — a2)v | (82 — 02)u) + 4(8e0)0 | Oe Oyu). 
The choice of Dirichlet form will be partly determined by the boundary 
conditions we wish to consider. 


The Dirichlet form (7.3) is said to be strongly elliptic on © if for 
some constant C' > 0, 


Re D> aap(zyettP > Cle?" = (€E R", 2 EN). 
lej=[6|=m 
If D is a Dirichlet form for L = 5 a,0%, it is easy to see that 


ay=(-1)" $7 dap (ly = 2m), 
a+p=7 
and it follows immediately that L is strongly elliptic and satisfies (7.2) if 
and only ifevery Dirichlet form for L is strongly elliptic. (This is the reason 
for the factor of (—1)™ in (7.2).) 

We now have to face the task of determining what happens to the 
formula {v| Lu) = D(v,u) when v and u fail to vanish near the boundary. 
For this we need some terminology. 

Let J be a finite set of nonnegative integers. A set {Mj }j¢, of differen- 
tial operators defined near S = 02 will be called a normal J-system on 
S if the order of M; is j and S is non-characteristic for each Mj. (The most 
obvious example is the system {0};¢7 of powers of the normal derivative 
defined on a tubular neighborhood of S by (0.3).) If & and / are nonnegative 
integers with | > k, we shall denote the set {k,k +1,...,0} by [k,l]. 

What we wish to prove is the following: 


(7.4) Theorem. 

Let D be a Dirichlet form of order m associated to the operator L on 
Q. Given a normal [0,m — 1]-system {Mj} on S, there exist differential 
operators N; of order j, m <j < 2m— 1, defined near S such that 


m-1 
D(v,u) — (v| Lu) = >> i (M;v)(Nom—1-) do. 
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The proof is straightforward but tedious, and we shall leave some of the 
details of the calculations to the compulsive reader. It proceeds in three 
steps, working up from the simplest special case to the general one. To 
begin with, we set 

X~ ={zER": 2, <0}, X° = {rE R": 2, =O}. 
(7.5) Lemma. 


Let D be a Dirichlet form for L on X~. There exist differential operators 
N; of order j, m <j < 2m-—1, defined near X° such that 


pow) fat =S [eon 


for all v,u € C™(X~) of which at least one vanishes for |z| large. 


Proof: Integration by parts shows that 
/ (auju+ [ v(0;%) = 0 for j <n, 
x7 x- 


[Gua i: 007) = f vt, 


and repeated application of these formulas yields 


Oye — (— jo] aa 
[rem oe! [warm 
== [caer -ate'm, 


where a’ = (a1,...,@,~1). Now let D be given by (7.3). Replacing u by 
a960°u and summing over a and (, we obtain 


m-1 
D(vu)— fo oT) = > feb) Wana) 
gj=0 


where 


Nominiu= (Hale taleeeI-Vfagg dul. 
lel,J8l<m; an<j+l 
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(7.6) Lemma. 
let D, L, X~, X° be as above. If {M;} is a normal [0,m — 1)-system on 


X°, there exist differential operators Nj of order j, m <j < 2m—1, such 
that 


m-1 
Ce i o(Eu) = So i (Mj)(Nigna 8) 
x- 0 vx 
for all v, u € C®(X—) of which at least one vanishes for |x| large. 
Proof: The assumption on the M; means that we can write 


B= bjMj+ Y> — bajd* — (bj, bas E C®). 


lal<j, an<j 


If {Nj} are the operators of Lemma (7.5), by integration by parts on X° 
we find that 


[RVR maae = f Mj )G Nam 4 Gam: 


7=0 
where 
, : i= 
Abm=1-it = om [> (-1)!¢ lge [8¢e!,i); Nem—1-j 4] 
la‘|<j-1 
and again, a’ = (aj,...,@n~1). Clearly Abs 1-; is a differential operator 


of order 2m—1~—7. ‘Applying this formula with 7 = m — 1 to the result of 
Lemma (7.5), we obtain 


D(v,4)< fe: v(La) = fe (Mm10)(NEx) +> ee (a4.v)(NE,a8); 


$=0 


where N} = bmNm and NB y_; = AMT» + Nom-1-i for i< m-—1. 
Now repeat this argument with m replaced by m— 1 and Nj; replaced 


by N}, obtaining a formula 


m—2 : 
ef eoE a) 
#=0 


= | (Mn-20 (Mae) + y ae (a5 v)(NE,, a), 


#=0 
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whence 


D(v, u) - fe i) = fm rv)(WEH) +f (Mm-20) RR ¥) 


m-3 : 
+0 [GR a). 


i=0 
Continuing this process inductively, after m steps we obtain the desired 


result with Ni4; = Nae ] 


Proof of Theorem (7.4): Let Vo,...,Vy be an open covering of 2 
such that Vo C 2, Vi NS # @ for i > 1, and for each i > 1, 


ViNQ= {ze Ve ar< fei, +:BIH1,2141)---12n)} 


for some ! and some smooth function f;. Let ¢o,...,¢n be a partition of 
unity on 9 subordinate to this covering. We write 


N 


D(v, u) — (v| Lu) = YL [DO, Gu) — (v| L(Gu))]. 


0 


Clearly D(v,¢gu) = (v|Z(Cou)). For ¢ > 1, make the coordinate transfor- 


mation 

zj for j < l, 

w= {5s forl<j<n, 

zi—fi(t1,...) forj=n, 
which takes V; MS to a portion of the hyperplane X° and V;NQ toa 
portion of the half-space X~. The operator L assumes a new form J, in 
these coordinates, and the system {M]j;} becomes a normal system {;M;} 
on X°. Moreover, | det(Oy; /Oz%)| = 1, so L? inner products are preserved 
and D is transformed to a Dirichlet form D,; for L;. Applying Lemma 
(5.6), we obtain for each i a system {;Nj} (m <j < 2m— 1) of differential 
operators defined near y, = 0 such that 


Da(vs Gen) = (v|La(Gen)) = Of (eM CNp ag). 


Transforming back to the original coordinates, ;Nj becomes an operator 
iN; defined near V; NS, and we have 


m—-l 
Dw, Gen) ~ (v | E(Ce0)) = Yo f (Mj 2)(NamiiGa)) do, 
j= 
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where ;Nj; is iN}! multiplied by the change-of-measure factor from Eu- 
clidean measure on X° to surface measure on S. Therefore, if we set 


Nju= So iNj (Gu), 
1 


N; is a differential operator of order j defined near S, and it fulfills the 
conditions of the theorem. i] 


Remark: A close examination of the proof shows that if D is strongly 
elliptic, {N;} is a normal [m, 2m — 1]-system on S. 


We need one further result along these lines: 


(7.7) Proposition. 
Let {M;} be a normal (0, k]-system on S. Given fo,..., fr € C~(S), there 
exists w € CO°(R") such that M;w = f; on S. 


Proof: First suppose M; = 0). Let V be the tubular neighborhood 
of S given by (0.2), and choose ¢ € C2°(R) with ¢(t) = 1 for |t] < fe and 
(t) = 0 for |t| > $¢. We can then define w on V by 


k ti 
w(x +tv(x)) = o(t) > Fi) 


and set w = 0 outside V. 
In general, we have 


i 
M; = > as; (z, aya, 
0 


where a;;(z, Q) is a differential operator of order j~i which acts tangentially 
to S and a;;(z,0) = aj;;(z) never vanishes. Hence M;w = f; on S if and 
only if 


j-l 
Abw(2) = os | te) -— Dante debw(e)]. 
a;;(z) $20 
But the quantity on the right is completely determined by the f;’s and 
their derivatives on S, by induction on j. Namely, for j = 0 it is fo/aoo; 
for j = 1 itis 


aah — ao;(z, dw] = re [A — 49;(z, 0) (£)] ; 
tt ayy 


ao0 


and so forth. Thus the problem is reduced to the case M; = 83, which we 
have solved. 1 
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C. Dirichlet Forms and Boundary Conditions 


We now construct a general scheme for setting up boundary value prob- 
lems in terms of Dirichlet forms. To fix the ideas, let us first consider the 
Dirichlet problem (with homogeneous boundary conditions) for a strongly 
elliptic operator L of order 2m on Q. This is roughly the following: Given 
a reasonable function f on Q, find a function u satisfying Lu = f on Q and 
diu=00n S = AQ for 0 <j <m. Our definition of “reasonable” will be 
f € Ho(Q) = L?(9), and we shall accordingly allow u to be a distribution 
solution of Lu = f. If f is sufficiently smooth, the Sobolev lemma and the 
local regularity theorem (6.33) will guarantee u to be a classical solution. 

To start out, we must search for u in a class of functions for which 
the boundary conditions make sense, and in view of Corollary (6.48) the 
natural candidate is Hy,(Q). Moreover, since the condition 0Ju = 0 on S 
for 7 < m implies that all derivatives of u of order < m vanish on S, by 
(6.49) and (6.50) a reasonable interpretation of the Dirichlet conditions is 
that u should be in H°,(Q). We therefore reformulate the Dirichlet problem 
as follows: Given f € L?(Q) = Ho(Q), find u € H(Q) satisfying Lu = f 
on 2. 

This is still somewhat awkward, since if u € H°,(Q) then Lu is a priori 
only known to be in H_m(IR"). It would be better to formulate the problem 
so that only derivatives of u up to order m occur, and this is where a 
Dirichlet form comes in handy. 

Let D be a Dirichlet form for ZL on Q (it doesn’t matter which one 
at this point). We note that D(v,u) is well defined for v,u € Hm(Q). 
Moreover, if u € Hm(Q) and v € Co°(2) we have D(v, u) = (L*v | u); thus 
u is a distribution solution of Lu = f if and only if D(v, u) = (v| f) for all 
v € C%(Q). In this case we also have D(v,u) = (v| f) for all v € H2,(Q) 
by passing to limits. Thus the final version of our problem is the following: 


The Dirichlet Problem: Given f € Ho(Q), find u € H®(Q) such 
that D(v, u) = (v| f) for all v € HE(Q). 


By the local regularity theorem (6.33), we know that any solution u will 
be in H2,(Q) MN H'e¢(Q). It actually turns out that u € H?,(Q) A Ham(Q); 


more generally, if f € H,(Q) then u € H2(Q)N Hiyom(Q). 
The inhomogenous problem 


Lu= f onQ, Au =g; on S forO<j<m, 


can be reduced to the homogeneous problem, provided the g;’s are rea- 
sonably nice. Namely, since we expect the solution to be in Ham({Q) we 
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require that there exist a function g € Ham(Q) such that 63g = g; on S 
for 0 < j < m in the sense of Corollary (6.48). Then, setting w = u—g, 
we are reduced to solving 


Lw=f—LgonQ, OGw=0onS for0<j<m, 


or in other words, to finding w € H®(Q) such that D(v,w) = (v| f — Lg) 
for all v € H2(Q). 

Henceforth we shall consider only homogeneous boundary conditions, 
with the understanding that inhomogeneous problems can be handled as 
above. A much more detailed discussion of inhomogeneous problems can 
be found in Lions and Magenes [34]. 

We can set up other boundary value problems in a similar fashion. Let 
X be a closed subspace of H,,(Q) which includes H2,(Q). Any u € Am (Q) 
agrees locally inside Q with elements of X, since Cu € H2(Q) C X for any 
¢ € C&%(Q). Thus for u € H,,(Q) the condition u € X is a condition on 
the behavior of u at the boundary. Therefore, we might consider boundary 
value problems of the following form: Given f € Ho(), find u € X such 
that Lu = f on Q. In general, such a problem is underdetermined, as 
the condition u € X does not give enough boundary data. Here again, 
the formulation in terms of a Dirichlet form for ZL comes to the rescue. 
Consider: 


The (D,X) Boundary Value Problem: Let D be a Dirichlet form 
for L and let X be a closed subspace of Hm(Q) containing H®(Q). Given 
f € Ho(Q), find u € X such that D(v,u) = (v| f) for allv € X. 


We make several comments on the meaning of this problem: 


i. Since C9°(Q) C X, a solution of this problem will satisfy (L*v|u) = 
(v[f) for all v € Co°(Q) and hence will be a distribution solution of 
Lu f. 


ii. However, we require that D(v, u) = (v|f) not just for v € Cp°(Q) but 
for all v € X. This automatically imposes some additional boundary 
conditions on u — namely, the ones that ensure that the boundary 
terms coming from the integration by parts from D(v, u) to (v| f) van- 
ish. These are the so-called free (or natural or unstable) boundary 
conditions. As we shall see, the requirement that u and v both be- 
long to the same space is just what is needed to make the existence 
and uniqueness theory run smoothly, so this formulation automatically 
provides the right number of boundary conditions. 
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iii. If X #4 H°(Q), the choice of Dirichlet form matters, because it affects 
how the integration by parts proceeds. 


By choosing D and X properly, we can construct a wide variety of 
boundary value problems in this fashion. Let us look at some examples. 


(7.8) Example. 
X = H®(Q). This is the Dirichlet problem. 


(7.9) Example. 

X = H,,(Q). Here there are no a priori boundary conditions. To see 
what the free boundary conditions are, suppose that u and f are smooth 
functions on 2 satisfying D(v, u) = (v| f) for all ve C(O). By Theorem 
(7.4) there are operators N; of order j7, m <j < 2m, such that 


0 = Div,u)- (v|f) = & [tema ma) do. 


But by Proposition (7.7) we can choose v so that Av = Nom—1—jt for all 
j, so it follows that 


Nom-i-ju =0o0n S forO<j<m. 
Thus we obtain m boundary conditions. Note that these boundary condi- 
tions are of orders m,...,2m— 1, so a priori they do not make sense for 
u € Hm(Q). However, under suitable hypotheses the solution will turn out 


to be in H2,,(Q), so the boundary conditions are well defined in the sense 
of Corollary (6.48). 


(7.10) Example. 
More generally, let {M;} be a normal [0,m-— 1]-system on S, and let J be 
a subset of [0,m— 1] and J’ the complementary subset. We take 

X = closure in Hm() of {y € C°(Q): Mjv = 0 on S for j € J}. 


As above, suppose u € X and f are smooth functions on 2 satisfying 
D(v, u) = (v|f) for all v € X, and let {Nj} be the operators of Theorem 
(7.4). Then if v € C°(Q) NX, we have 


0 = D(v,u)— (1) = D> f (My0)(Namai-s) de. 
ges’ 
But by Proposition (7.7) we can find v € C(Q) such that Mju = 0 for 
j € J and Mjv = Nom-i-ju for j € J’. It follows that the boundary 


conditions on u are 


Mju=0onS(jéJ), Nom-1-ju=0onS (je J’). 
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(7.11) Example. 
Now let us be specific. Let 


L=- 2 a;j30;0; + >> 45d; +a 
j=l 


ij=l 


be a second order operator with real coefficients, where (a;;) is positive 
definite (and in particular, symmetric). Let us rewrite L as 


L= —S> b:50:9; + >> 558; + > 65; +a 
where b;;, bj, and cj are real, b;; + bj; = 2a;;, and 6b; +c; = a;. By the 
divergence theorem, 


(v| - b:;0;0; u) = (0;(b:jv) | 0; u) - | vbjju0;udo 
§ 
= (Ov | bjjO;u) + (v | (Oibij Oj u) — / vb;;,0;tde, 
Ss 


and 
(v |b; Oj;u) = —(0; (bj v) | u) +f vbjvjutde 
Ss 


= —(0;v | bju) — (v | (0;b;)u) + vb; vjide. 
s 
Thus if we define the Dirichlet form D for L by 
D(v, u) =) (Gi | bs 9ju) — D7 (jv | bu) 


3 j 


+ Dov | (Dy ibig + 5 )O;u) + (v | (a — 0, 0;65)u), 

i 

we have 
D(v, u) — (v| Lu) = Hf [do by %40;% — oe bj va do. 
ie Hag j 

Hence if we take X = H,(Q), so that v|S is arbitrary, we obtain the bound- 
ary condition 

ye bj wiOju — So byyyu =OonS. 

ij j 


If we set 


By = > bin, w= (M1,.-.,Hn), OQuu=p- Vu, B= >_ byy;, 
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this equation becomes 


0,u— Bu=O0onS. 


By choosing the quantities bj; and 6; properly, we can obtain such condi- 
tions for more or less arbitrary yp and @, provided that w-v > 0. (This 
condition is necessary since }>bjjujvj = So aijujv; > 0.) In particular, if 
we take bj; = a,;, 4 is called the conormal to S with respect to L and is 
denoted by v". (If aj; = 6;; as for the Laplacian, of course, then vy = v*.) 


If we also take 6; = --- = b, = 0, we obtain the Neumann boundary 
condition 

d,-u = 00n S. 
On the other hand, if we take b;; = a;; +¢,; where cj; = —cjj, we have p = 


v* +7 where rT = )_; ¢jj¥44. Since (cj; ) is skew-symmetric, rT is perpendicular 
to v, ie., r is tangent to S. Thus, given @ € C™(S) and any smooth 
tangential vector field 7 on S, to construct the boundary conditions 


O,-u+ 0,u— Bu=O0onS 


for the oblique derivative problem, we choose the skew-symmetric 
matrix (cj) = (bi; — aij) so that a cyu; = 1; and the vector field 
b= (b1,...,6n) so that b- vy = B. 


EXERCISES 


1. Consider the Dirichlet form D(v,u) = (Av| Au) for A? on R". What 
are the free boundary conditions for the (D,X) problem if (a) X = 
H2(Q), (b) X = H2(Q)N HP(Q)? 


2. Let Q be a bounded domain in R” and c a positive constant. Find a 


Dirichlet form D for -A on Q such that D(v,u) = D(u,v) and the 
(D, H1(Q)) problem is the problem 


Au=fonQ, @u+cu=0on dQ. 


(This problem arises in the study of heat flow where the boundary 
condition is given by Newton’s law of cooling; see §4C.) 
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D. The Coercive Estimate 


Our setup is still too general to produce nice results. For example, consider 
the Dirichlet form 


D(v, u) = 4(dsv | Oru) = (Oz + i, )u | (Oz + tdy)u) 


for ~A on R?. Let 2 be the unit disc, and Jet X = Hy({M). Consider the 
(D,X) problem with f = 0, ie., find u € H1(Q) such that D(v, u) = 0 for 
all v € Hy(Q). Taking v = u, we see that u € H;(Q) solves this problem 
precisely when Osu = 0 on 2; that is, when u is a holomorphic function 
of z= 2 -+iy in Q. Thus uniqueness and regularity at the boundary both 
fail miserably: the solutions form an infinite-dimensional space, and most 
of them are not smooth at the boundary even though f = 0 is C™ there. 
Moreover, even for those solutions that are smooth on 9 we can obtain no 
estimates on their derivatives. For example, let ua(z) = log(z +a) where 
a — 1 is real and positive and log is defined by cutting the plane along 
the negative real axis from —oo to —a. Then ug € C%(®) and ||uafloa = 
fg \ual? is a bounded function of a since the logarithmic singularity is 
square-integrable. However, O¥ug(z) = (—1)*~}(k—1)'(z+a)7* for k > 0, 
and the L? norm of this function on 2 blows up as a — 1. (On the other 
hand, if we take X to be H?(Q) instead of H,(Q), there is no problem: a 
holomorphic function on 2 that vanishes on 62 is zero.) 

Another example of this phenomenon that works in higher dimensions 
can be found in Exercise 1. 

We therefore introduce the following definition, which is designed pre- 
cisely to avoid such pathologies. A Dirichlet form D of order m on 2 is 
said to be coercive over X, where H2(Q) C X C Hm(Q), if there exist 
constants C > 0 and A > 0 such that 


(7.12) Re D(u, u) > Cllullna—Allullon (ue X). 


Dis called strictly coercive over X if we can take A = 0 in (7.12). (Notice 
that if D is a Dirichlet form for L satisfying (7.12), then 


D'(v,u) = D(v, u) + A(v | u) 


is a strictly coercive Dirichlet form for L+A.) The example discussed above 
is not coercive, as one sees by taking u = ug and letting a — 1. 

If the space X is defined by a normal system of boundary conditions as 
in (7.10), general conditions of an algebraic nature on D and the boundary 
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operators M; and Nj are known which ensure that D is coercive over X; see 
Agmon [2]. However, the study of these conditions is beyond the scope of 
this book, and we shall content ourselves with a discussion of coerciveness 
for the more specific problems (7.8) and (7.11). 

Our first result is very simple, but it covers the second order Neumann 
and oblique derivative problems discussed in (7.11). 


(7.13) Theorem. 
Let 
D(v,u) = S>(0;v | bij u) + D> (0;0 | byu) + D(v [b50ju) + (v | bu) 
tj j j 
be a strongly elliptic Dirichlet form of order 1 on 2, and suppose the 
bij ’s are real-valued. Then D is coercive over H,(Q) (and hence over any 


XC Hy(Q)). 


Proof: Set aj; = 4 (bi; +5;;). Since the };;’s are real, strong ellipticity 
means that for some Co > 0, 


Do ais Ges = Do bisGiE; > Colé|? 
for all€ € R®. Thus (a;;) is positive definite, so if € is any complex n-vector, 
Re) digi = D> ais6i€; 2 Colél?. 
Setting € = Vu, where u € H,(Q), we obtain 
Re >, bi(Aiu)(G@) > CoD ~ [9ju?, 
so an integration over 2 yields 
Re (Aju | bi50ju) > Co > [19jullo.0 = Co(llull?.a— tlll3,9)- 
Also, for some C; > 0 (independent of u) we clearly have 
(Oj u | bju)| < |ulfrallbsulloa < Crllulls allullon, 
J(u | 659; u)| < Hulls all; 9; ulloa < Cillulls allello,o, 
\(u] bu)| < Cillullo.a < Cillull:allullon- 
Therefore, setting C, = (2n + 1)Ci, we have 
Re D(u, u) > Co((lulli.a — llello,0) — Callulli allullo,a. 
But since af < (a? + 87) for alla, 8 > 0, 


Co Cc? 
Col lull: allulloa < Sulina + 5 +k 3G, lull, om 


sO 


C, 2C2 + 
Re D(u, u) > 5 lull. - Te 2 |Iull5,0- 
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Another simple and useful result is the following. 


(7.14) Theorem. 
Suppose 


Divuy= So (8%v J acpd*u) 


lel=[4[=m 


where the aag’s are constants. If D is strongly elliptic, then D is strictly 
coercive over H®,(Q). 


Proof: If ué€ H?(Q) C Hm(R"), by the Plancherel theorem and the 
definition of strong ellipticity we have 


Re D(u,u)=Re So [enPraapée* |a(e)P ae 
lal=|pl=m 


>of an?mePr lace ae 
> 0 > femmes Plater ae 


lal=m 
=C' >> |la°ulls. 
jaj=m 


The result therefore follows from Proposition (6.15). | 
This theorem can be generalized: 


(7.15) Garding’s Inequality. 
Every strongly elliptic Dirichlet form of order m on Q is coercive over 
Hy, (2). 


Remark: The inequality in question is of course (7.12). Garding’s 
inequality is a milestone in the theory of elliptic equations; it antedates, 
and is the inspiration for, the general notion of coerciveness. 


Proof: The outline of the argument is much the same as the proof 
of Theorem (6.28). Theorem (7.14) provides the first step, the case of 
constant coefficients and no lower order terms. 

Given u € H?,(Q), let us write 


Diu,u)= D> (%ulaapdu)+  S>  (A%ulaagd?u) 
jal=jej=m min([a],|a}}<m 
= Dp(u) + Do(u). 
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To begin with, since either Ja| << m or |f| < m in each term of Do(u), for 
some constant Cp > 0 we have 


(7.16) |Do(u)| < Collul!m,allullm—1.0. 


Next, we shall show that Re D,,(u) dominates ||u|]?, 9 when the support 
of u is small. For any to € 2 we can write 


Dm(u)=  D) (O%ufaap(20)0u)+ D> (8%u| (aap—aap(20))0°u). 
jal=|pl=m fel=[s[=m 
By Theorem (7.14) (and its proof), 
Re > (8%u|aep(x0)0°u) > Cillullinas 
Ja|=|Al=m 


where C\ is independent of zo. On the other hand, let us choose 6 > 0 so 
small that if |x — zo| < 6 then 


S> laaa(z) — aap(zo)| < ot 


lal=[sl=m 


Then if u is supported in Bs(zo), 


Dd.  (8%u| (aap — 4a6(20))d° u) 


fal=[al=m 


C; 
<_sup D7 [aaa(z) ~ aaa(z0)] [107 ullo,allO?ullo,a < *[Iullm.n- 


le~s0l<? jaj=[a|=m 


Hence, for u supported in a ball of radius 6 we have 
Cc 
Re Dn(u) > [hull 


Now cover { by finitely many balls of radius 6, say Bs(x1)..., Bs(xn), 
with z1,...,¢y € 2. Choose ¢; € C&°(Bs(x;)) such that ¢; > 0 and 
pied 43 > Oon 0, and set ¢ = [S00 $2)-3/?. Then G? is a partition of 
unity on 2, so 


N 
Dn(u)= > SY) (68% u] aap du) = Ai(u) + Aa(u), 


J=1 [a|=|sl=m 
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where 
N 
Au) =D SD (8%(6ju) | aap" (G54), 
J=1 fal=|[sj=m 


N 
Alu = S75 SS (Key, Ou fGjaen du) + (8% (Gu) | [C), 0°Ju)) 


7=1 |al=lpl=am 


Since ¢;u is supported in Bs(x;), we know that 


C, & 
Re A; (u) > Z Lelissallna 


But 
N N 
SriGulla 22 YS (O%Gu) ]a%(Gu)) 
1 1 lal=m 
N 
= 32 3 Gaul Gory) 
1 falsem 


N 
+55 SS (0%, 6j]u | O%(Gju)) + (G74 | [8% G]u)). 


1 |al=m 


The first sum on the right is just 


S> (ul d%u) = S> 8% Ulloa, 


lal=m jal=m 
which, for u € H9,(Q), dominates |lul]?,.q by Proposition (6.15). On the 
other hand, [8°,¢;] is a differential operator of order m— 1, so the second 
sum is dominated in absolute value by ||u||m,allullm—1,0. For the same rea- 


son, Ao({u) is dominated by |]u}|m,nl]ul]m—1,2. In short, for some constants 
C2,C3 > 0, 


Re Dm(u) > Caljullis,a — Callullm.allullm—a,o, 
and combining this with (7.16) we obtain 


Re D(u, u) > Calfull?, og — (Co + Cs)|]ullm allullm—1.n- 
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But, as in the proof of Theorem (7.13), we have 


C2 
(Co + Cs)||ullm,allullm—i.0 < = llulln.a + Callullin—1.05 


where C4 = (Co + C3)*/2C2. Also, since || - [k,n is equivalent to |] - ||, on 
H®(Q) for k < m, by Lemma (6.11) there is a constant Cs > 0 such that 


C2 
Callullm—a.o SF llullina + Csllullé,o- 
Finally, combining these estimates yields 
Cry 2 2 
Re D(u, u) > [lullma — Csllullo.a, 
and the proof is complete. 1 
Garding’s inequality also has a converse: 


(7.17) Theorem. 
If the Dirichlet form D of order m is coercive over H°,(Q), then D is 
strongly elliptic. 


Proof: Given ¢ € C%(Q), € 4 0 € R*, and r > 0, let u(z) = 
d(x)e'€*, Then 0%u(x) = (ir€)*u(x) modulo terms of order less than Ja| 
in T, so 


D(u, u) = rm bs (€%u | Gaal u) eee 
lel=[6l=m 


=r get? | aag(2)16(2)[? de ++ 


lo|=|4|=m 
where the dots indicate terms of lower order in r. Likewise, 
lula = Dd) WO%ullan = 7?" So 62 ft |6(x)?dzt+---, 
la|<m jal=m 


and |lullon = [l¢llo2. Hence, if we apply the coercive estimate (7.12), 
divide by r?™, and let 7 — 00, we obtain 


Re D> ett? fanp(z)lo(e)Pdz > oY ef Ipe)PPae 
(7.18) lal={sl=m jaf=m 
> cre  1o(a)Pae. 
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Now, given zp € %, choose » € C& with f |p|? = 1 and take ¢(x) = 
e—"/245(e-! (x — xo)) where € > 0 is small enough so that ¢ is supported in 
©. Then as ¢ — 0, |]? approaches the delta-function at ro (cf. the proof 
of Theorem (0.13)), so in the limit, (7.18) becomes 


Re SY) €°*agp(z0) > CEL. 
lo|=|sl=m 
This inequality holds for all zo € 9, hence for all zo € % by continuity; 
that is, D is strongly elliptic on . 4 


We note that if D is coercive over X > H®(Q), then a fortiori D is 
coercive over H®.(Q). Thus strong ellipticity is always a necessary condition 
for coerciveness. 


EXERCISES 

1. Show that the Dirichlet form D(v, u) = (Av| Au) for A? on R® is not 
coercive over H2(2) for any 2 C R". (The idea is similar to the example 
in the text.) 


2. Explain why the proof of Theorem (7.13) breaks down for forms of 
higher order or for forms whose coefficients b;; are not all real. 


E. Existence, Uniqueness, and Eigenvalues 


We now confront the questions of existence and uniqueness of solutions for 
the (D, X) boundary value problem, where D is assumed to be coercive 
over X. We have set up the problem in such a way that the answers to 
these questions are obtained very easily. We need only one more tool. 


(7.19) The Lax-Milgram Lemma. 

Let H be a Hilbert space with inner product (-|-) and norm |} - |], and let 
D:HxH —C be a sesquilinear form on H (not necessarily Hermitian 
symmetric). Suppose there are constants C,,C2 > 0 such that 


[D(v,u)] < Cilloll ull, |D(u, u)| > Cal}ul)? 


for all u,v € H. Then there exist invertible bounded operators ® and V 
on K such that for all v,»w € H, 


(v|w) = D(v, dw) = D(Vw, v). 


ENiptic Boundary Value Problems 249 


Proof: Given u € K, the map v + D(v,u) is a bounded linear func- 
tional on H, so there is a unique Ru € KH such that (v| Ru) = D(v, u) for 
all v. We have ||Rul| < C,|]ul], so R is a bounded linear operator on H; 
moreover, 


[Rul] |lull > (u| Ru)| = |D(u, u)| > Cailull?, 


so ||Rul| > C2|lu||. Therefore R is injective and the range of RF is closed, for 
the convergence of {Ru;} implies the convergence of {u;}. On the other 
hand, the range of R is dense, for if u € KH is orthogonal to it we have 
D(u,u) = (u| Ru) = 0 and hence u = 0. Hence R is invertible, so if we 
set ® = R-! we have (v|w) = D(v,w) for all v,w € HK, and ||@w|| < 
Cy}|\w||. Similarly, if we define the operator B by (v| Bu) = D(u, v), then 
B is invertible and we can take ¥ = Bo}. ' 


Our first main result is the following: 


(7.20) Theorem. 

Let X be a closed subspace of Hm(Q) that contains H2,(Q), and let D be 
a Dirichlet form of order m that is strictly coercive over X. There is a 
bounded injective operator A : Ho(Q) — X that solves the (D,X) problem; 
that is, D(v, Af) = (v| f) for allv EX and f € Ho(Q). 


Proof: X is a Hilbert space with norm ||- ||m,2, and D satisfies the 
hypotheses of the Lax-Milgram lemma on X; hence there is a bounded 
linear operator ® on X such that (v|w)mo = D(v, Ow) for all v,w € X. 
On the other hand, if f € Ho(Q), the map v > (v| f) (ZL? inner product) 
is a bounded linear functional on X: 


Kel AL s IFllo,allello.a < |lfllo,allullm,a. 


Hence there is a unique Rf € X such that (v| Rf)m.n = (v|f) for allv € X, 
and ||Rf||m,a < {lfllo.a. & is thus a bounded linear map from Ho() to X, 
and the desired solution operator is A = @o R. i] 


We can also consider the adjoint Dirichlet form 
D*(v,u)= D(u,v) (u,v € Hm (Q)). 
Clearly if D is a Dirichlet form for Z then D* is a Dirichlet form for the 


formal adjoint L*, and the (D*, X) problem is a boundary value problem for 
L* which is called the adjoint of the (D,X) problem. The (D,X) problem 
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is called self-adjoint if D = D*. This implies that L = L* and also that 
the free boundary conditions for the two problems are the same. 

In any event, if D is strictly coercive over X then so is D*, so Theorem 
(7.20) yields linear maps A and B from Ho({) to X that solve the (D, X) 
and (D*,X) problems. Let us denote by T and S the compositions of A 
and B, respectively, with the inclusion map X — Ho(Q). Thus T and S 
are operators on Ho({{). There are two crucial observations to be made: 
(i) T and S are compact. This follows from Rellich’s theorem (6.46). 

(ii) S = T*. Indeed, for any f,g € Ho({Q) we have 


D(Bg, Af) = (Bg | f) = (S| f), 


and on the other hand, 
D(Bg, Af) = D*(Af, Bg) = (Af | 9) = (TF 9) = (9 ITS), 


so that (Sg | f) = (g|TS). 
With this, we are ready to handle the general case. 


(7.21) Theorem, 
Let X be a closed subspace of Hm() that contains H®(9), and let D be 
a Dirichlet form of order m that is coercive over X. Define 

V={ueX: D(v,u) =0 for all ve X}, 

We {uEX: D(u,v) =0 for allv € X}. 
Then dimV = dimW < oo. Moreover, if f € Ho(Q), there exists u € X 
such that D(v,u) = (v| f) for all v € X if and only if f is orthogonal to W 


in Ho(Q), in which case the solution u is unique modulo V. In particular, 
if V = ‘W = {0} the solution always exists and is unique. 


Proof: By assumption, 
[D(u,u)| > Cllulina—Allullan (ue X), 


where we can assume A > 0 since the case A = 0 is covered by Theorem 
(7.20). Let 
D'(v,u) = Div, u) + A(v| u). 


Then D?’ is strictly coercive over X, so there is a compact operator T on 
Ho(Q) whose range is in X such that D'(v, Tf) = (v| f) for all vy € X and 
all f € Ho(Q). Now, 


Div,u)=(v[ fy) <=> D'(v, u) = (v| f) + A(vfu) = (v| f + Au). 
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For fixed f and u, the latter equation holds for all v € X if and only if 
T(f + Au) = u; thus, 


D(v,u) = (v[f) forallue X <> Amtu—Tu=A7'TyfF. 


In particular, taking f = 0 we see that V = {u € Ho() : Tu = A~"u} (the 
condition Tu = A~!u automatically implies u € X). Likewise, in view of 
the remarks preceding the theorem, W = {u € Ho(Q) : T*u = A~!u}. The 
first assertion therefore follows from Theorem (0.38). Moreover, if w € W, 
for any f we have (Tf |w) = (f |T*w) =A7!(f| w), so f £ W if and only 
if Tf LW. The second assertion therefore follows from Corollary (0.42). § 


In case D is self-adjoint, we can say more. 


(7.22) Theorem. 

Suppose D is coercive over X and D = D*. There exists an orthonormal 
basis {uj} of Ho(Q) consisting of eigenfunctions for the (D,X) problem; 
that is, for each j we have u; € X and there is a real constant 1; such that 
D(v, uj) = wj(v]uj) for all v € X. Moreover, 4; > —X for all j where X is 
the constant in (7.12), limjoo py = +00, and uj € C(Q) for all j. 


Proof: Let T be the solution operator for D’ as in the proof of The- 
orem (7.21) (we no longer assume A > 0). T is compact and self-adjoint; 
it is also injective, being the composition of an invertible map from Ho(2) 
to X with the inclusion map from X to Ho(Q). Hence, by the spectral the- 
orem (0.44) there is an orthonormal basis {u;} for Ho({) and a sequence 
of nonzero real numbers {a;} with limjioo aj = 0 such that Tu; = aju;. 
(In particular, uj = az Tu; € X for all 7.) Since 


ally; |}? = (Tu; | uj) = D'(Tu;,Tuj) = Re D'(Tu;,Tu;) > 0, 


we have a; > 0 for all 7. Thus, if we set pj; = a>} — ), we have by > -A, 
limjoo fj = +00, and D(v, uj) = pj(v[uj) for all v € X. Finally, if 
L is the elliptic operator associated to D, u; is a distribution solution of 
(L — pj )uj = 0 on Q, so uj E C™(Q) by Corollary (6.34). i] 


Now let us interpret this theorem in some specific instances. First, the 
Dirichlet problem. 
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(7.23) Theorem. 

Suppose that L is a strongly elliptic operator of order 2m on {} that satisfies 
L = L* and (7.2). There is an orthonormal basis {u;} for Ho(Q) consist- 
ing of eigenfunctions for L which are C® on Q and satisfy the Dirichlet 
conditions 9,u; = 0 on 09 for 0 <i < m. The eigenvalues are real and ac- 
cumulate only at +oo. If L has a self-adjoint Dirichlet form that is strictly 
coercive over H°(Q), the eigenvalues are all positive. 


Proof: Let D be a Dirichlet form for L. Then D* is a Dirichlet form 
for L* = L. Since for the Dirichlet problem the choice of Dirichlet form is at 
our disposal, we can use the self-adjoint form 3(D + D*), which is coercive 
over H®,(Q) by Garding’s inequality. All the assertions then follow from 
Theorem (7.22) except the claim that uj; extends smoothly to the boundary. 
We shall prove this in §7F for the case m = 1; for the general case, see the 
references in §7G. 1 


(7.24) Corollary. 

There is an orthonormal basis for Ho(Q) consisting of eigenfunctions for 
the Laplacian such that u; € C%(Q) and uj = 0 on 80 for all j. The 
eigenvalues are all negative. 


Proof: The Dirichlet form D(v, u) = 0} (0;v | ;u) for —A is strictly 
coercive over H?(Q) by Theorem (7.14). ' 


Next, let 
L=- So 455 0:9; + So 459; +a 
i] i 


be a second order elliptic operator with real coefficients, where (a;;) is 
positive definite. A simple calculation shows that 


Le =— > 4:5 8,9; — 250 (d:a4;)9; ->> a; 9; — $5 (adja5j)- 928; a;)+a. 
sy iy i tj j 
Hence L = L* if and only if 
aj =— > A;(ai5) (j =1,...,n). 
In this case we have 


L=L*=- > 45018; - S 2 (G:445)3; +a 
ij ij 


7.26) 
( = — 5 4:(ai59;) + 4, 
ij 
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so L has the self-adjoint Dirichlet form 


(7.27) D(v,u) = (8:0 | aij9;u) + (v au). 
ij 


According to the discussion in (7.11), if we take X = Hi(Q), the (D,X) 
problem for this choice of D is the Neumann problem for L. Hence: 


(7.28) Theorem. 

Let L be given by (7.26), where the coefficients are real and (a;;) is sym- 
metric positive definite. There is an orthonormal basis {u;} for Ho(Q) 
consisting of eigenfunctions for L which are C® on Q and satisfy the Neu- 
mann condition 0,-u; = 0 on OQ. The eigenvalues are real and bounded 
below and accumulate only at +oo. If a > 0 (resp. a > 0), the eigenvalues 
are all nonnegative (resp. positive). 


Proof: The Dirichlet form (7.27) is coercive over H1(Q) by Theorem 
(7.13), so the existence of the eigenbasis and the first statement about the 
eigenvalues follow from Theorem (7.22). To prove the last statement, we 
re-examine the proof of Theorem (7.13). Since v aii; > Clé|? for all 
€&€C where C’ > 0, we have 


D(u,u) > C D> [0ull3,q + (ul au) > (ul au). 


Hence if uj has eigenvalue y;, 
2 
4 = wyllugll” = (uj | Luj) = D(u,, uj) 2 (uj |auy). 
If a > 0 the last expression is nonnegative, and if a > 0 it is positive. | 


For the Laplacian, in particular, the eigenvalues are all nonpositive. In 
fact, the only eigenfunctions with eigenvalue zero are the locally constant 
functions, for D(u, u) = > []0;ull§,.q > 0 unless Vu vanishes identically. 


F. Regularity at the Boundary: 


the Second Order Case 


Let L be a strongly elliptic operator of order 2m on Q. If f € Hy(Q) and 
u is a solution of Lu = f, the local regularity theorem (6.33) guarantees 


254 Chapter 7 


that u € 11195, ,(Q). If in addition u satisfies certain kinds of boundary 
conditions, it will turn out that u is actually in Hom4x(Q). In this section 
we shall assume that m = 1 and prove this assertion for u a solution of the 
(D,X) problem, where X is either H?(Q) or Hy(Q) and Dis a Dirichlet form 
for L that is coercive over X. This setup includes the Dirichlet problem 
for second order strongly elliptic operators and the Neumann and oblique 
derivative problems for second order elliptic operators with real coefficients. 
We shall make some comments on the higher order case, with references to 
the literature, in the next section. 

Most of the labor of the proof will be performed in small open sets near 
the boundary S = 02 which look like half-balls. Specifically, let V be an 
open set intersecting S such that VNQ can be represented as 


VOQ= {rEeV:a; < O(r1,..., 21-1, Ti41,---,2n)} 
for some i, where ¢ is a C™ function. Define new coordinates on V by 
Qj for j <1, 
Ui = ¢ Tj41 fori<j<n, 
rj —¢(r},...) forj=n. 


Then V NQ is represented in the y coordinates by the condition y, < 0. 
By a translation we may assume that y = 0 lies on VS, and we fix r > 0 
so small that the set where |y| <r lies in V. For any p < r, we then set 


N(o) = {u: lul <p and yn <0} = 9NB,(0), 


as in (6.42). 

Now let D be a Dirichlet form for the second order operator L on 22. 
Since | det(Oy; /Ozx,)| = 1, Lebesgue measure is preserved by the change of 
coordinates, and hence so are L? inner products. We assume that in the y 
coordinates D and L have the form 


D(v, u) = be (0% u|agg0%u), L= » a,0°, 
tal, I< laj<2 
where 0% = (0/dy)*. 
Also, let X be either H?(Q) or Hy(Q). We set 
X[r] = {ue X:u=0 on Q\ N(p) for some p <r}. 


The space X{r] has the following two crucial properties (valid for either 
X = H?(Q) or X = Hy(M)): 
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i. If C € C?(Q) vanishes on Q\ N(p) for some p < r, then Cu € X[r] for 
any u € X. 

ii. If u € X[r] vanishes on 2 \ N(p), then for any h € R with [h| < r—p 
and any j <n, the function 


Une; (y) = u(yi,- 2 Yj-15 95 +h, yj41, oe Yn) 


belongs to X and vanishes on 2\ N(p+A), hence belongs to X[r]. There- 
fore the difference quotients Afu, as defined before Theorem (6.19), 
belong to X[r] provided |h| < r—p and a, = 0. 
In this setup, then, we have the following local regularity theorem at the 
boundary. 


(7.29) Theorem. 

Suppose D is coercive over X, f € Hy(N(r)) for some k > 0, u € X, and 
D(v, u) = (v|f) for all v € X[r]. Then for any p <r, u € Higo(N(p)) and 
there is a constant C > 0, depending only on p and k, such that 


lulleza,v¢oy S C(I Fleer) + (lullaacry)- 


Proof: The proof is accomplished in three steps, of which the first is 
the most substantial. 


Assertion 1. Ifp<r,j < k+1, and 7 is a multi-index with |y| = j and 
‘Yn = 0, then 07%u € Hi(N(p)) and there is a constant C > 0, depending 
only on p and j, such that 


O7ullawco) S CCS llewery + Ulullavery)- 


Proof: By induction on j, the case j = 0 being trivial. Suppose the 
assertion is true with j replaced by 0,...,j3 —1. Set t = (29+ r)/3 and 
s = (p+ 2r)/3, s0 p< t<s <r. Then by inductive hypothesis, with p 
replaced by s, we have 05u € Hi(N(s)) for |6] < j — 1 and 6, = 0, and 


(7.30) O° ulls,wes) < C(I Fleer) + Ulellaav(ry)s 


where C is the largest of the constants obtained in the proof for 0,...,7—1 
with p replaced by s = (p+ 2r)/3, so that C depends only on j and p. 
Fix ¢ € C°°(G) with ¢ = 0 on Q\ N(t) and ¢ = 1 on N(p). If 7 is a 
multi-index with |y| = j and yn = 0, we wish to consider the difference 
quotients A? (Cu) for [h| < s —t, which belong to X[s]. Choosing some 
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index i < n with 7, # 0, we factor one difference operator in the i direction 
out of AY and write AY = Ai AY. 


We claim that for any v € X{s], 


(7.31) |D(v, Af ¢u)| < Cllullawesy (i fllewer + Mullion), 


where C’ depends only on p and j (and ¢, which is fixed). (Here and in 
what follows, differential and difference operators act on everything to the 
right of them unless separated by parentheses; e.g., Aj(Cu means Aj (¢u).) 
To prove (7.31), we shall commute various operators and move various 
quantities from one side of the inner product to the other, obtaining 


D(v, AZCu) = > (8%0 | apd" AZ Cu) 
= So (a%v | AYaug0°Cu) + Ey 
= Soar | AvaapCO®u) + Ey + Ep 
=(-1) S2CA7, 0% | aap 0° u) +E, + E, 
= (-1)) S0(0%CA% 0 | aap0°u) + Ey + Ea + Bs 
= (—1)) D(CA7,0, u) + Ei + Eo + Es 


= (-1) (CA%,0| f) + Lit E2+ Es 
= Ey + E2+E3+E4, 


where 
Ey = S7(0"» | foap, AN1O%u), 


By = D> (0%v| Afaap (0% ¢)u), 
(al=1 
E3= (-1)+} > ((O%Q)AX,Y | agp0"u), 
jaj=1 
E,=—(Ai,v ALCS). 


We have used the facts that AY commutes with 0% and 0°, that the adjoint 
of AY is (—1)'"A7,, and — in the formulas for Ez and E3 — that |a| <1 
and [| < 1. 

We claim that the terms £y,..., £4 all satisfy the estimate (7.31), and 
to prove this we make use of Theorem (6.51), Proposition (6.52), and the 
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product rule for derivatives: 


Ex] < C'llullivesy D> 18*O?Culo.wesy 
5<y7, i6]<1 


< C'llulli wey >, (O° Culltweey 


b<y 


SCelliney > WO ullswey, 


[8]<j—1, 5,=0 
[Ea] < C'ulh,wey D> 07% aap(8%C)ullo,.n(s) 
a,p 
SC"olhivoy D2 O° ullowcsy 


l81S5, 6n=0 


SC") DS * ull vee), 
18|<j—1, 6,=0 


|E3| = tv |AZ(0%C)aap 0° u) 


< Phe D219” (8%6)aap9° ullo,w¢s) 
ap 
<C"lolwey D> NO* allay. 
161<j-1, 6n=0 
The desired estimate for E,, E2, and E3 then follows from (7.30). Finally, 


|Eal < C'lulls,wesyllO” Cf lias) S C’llolla well fll, wer) 


since s <r and |7'| = 7 —1< k. Thus (7.31) is established. 
Now since Aj'Cu € X[s] we can set v = Aju in (7.31) and apply the 
coercive estimate: 


WARCullt nce) S Co(ID(ARCu, AnCu)| + ARCullo,wey) 

<S CiALCulls,coy (I Flleswery + Heth ery + ARCullo,w¢sy)- 
But, writing AY = Ai Ay, and applying (7.30) and Theorem (6.51), we 
have 

VARCullo,esy SO” Culla,v(sy S CU Flle.wery + Hlulli.vee): 

Therefore, 

ARC ull wes) S Ca(IFllewery + llelh very), 
where C, depends only on p and j. Assertion 1 then follows from Theorem 


(6.51) together with the observation that [|@7 ul]; wp) < [|O7Culli,w(s) since 
¢€=1on N(p). 
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Assertion 2. If p<r,j <k+2, and 7 is a multi-index with [y| < k+2 
and 7, = j, then 07u € Ho(N(p)) and there is a constant C > 0, depending 
only on j and p, such that 


O7ullowery < CC Flle ery + llellawery)- 


Proof: By induction on j. Assertion 1 shows that Assertion 2 is true 
for j = 0,1. If j > 2, set 


7 ae (V1 6y Y=) In 4 2), 


so |y'| = j—2< k. Since L = }>a,0% is elliptic, the coefficient @ = 
ao,...,0,2) of 6? is never zero, so the equation Lu = f means that 


Ou =a1(s- > a0"). 
On<2 
Thus 
Ou = 0" uz ar [a G - 


S a9"u)]. 


On<2 


On the right, after performing the differentiation a7’ we have a sum of 
smooth functions times derivatives of f of order < k, plus a sum of smooth 
functions times derivatives 0° u with |@| < k+2 and 8, <j—1. Taking L? 
norms on N(p), the first sum is dominated by ||fl|k,w(-), and the second 
sum is dominated by {| fIlk, vcr) + {lulla,wcry by inductive hypothesis. 


Now we can complete the proof of Theorem (7.29). Let C be the largest 
of the constants in Assertion 2 as 7 ranges from 0 to k + 2. Then all 
derivatives O7u with |y| << &+2 are in Ho(N(p)) and satisfy the estimate 


OT ullo,w(oy S C(I Fle, very + [ellavery)- 
Therefore u € Wi4o2(N(p)) and 
lullese.n(o) S CU lle.wery + luli very): 


Since N(p) clearly has the segment property, the theorem follows from 
Theorem (6.39). | 
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If we transform back to the original coordinates in which OQ need not be 
flat, we see that we have established the following local regularity results 
for solutions of the (D,X) problem. Suppose V is an open subset of Q; 
suppose u € X and f € Ho(Q) satisfy D(v,u) = (v| f) for all v € X; and 
suppose f € H;,(V). Then: 


i. If W is an open set with W C V, then u € Hey2(W). 


ii. If 29 € OQ has a neighborhood W such that WNQ C V, then there is 
a neighborhood W’ C W of zo such that u € Hy42(W'NQ). 


(i) follows from Theorem (6.33), and (ii) follows from Theorem (7.29). 
We now combine these results to obtain the global regularity theorem for 
solutions of the (D,X) problem with an optimal estimate. 


(7.32) Theorem. 

Suppose that X is either H?(Q) or H;(Q) and that D is a Dirichlet form 
of order 1 for the operator L which is coercive over X. Suppose further 
that u € X and f € Ho(Q) satisfy D(v,u) = (v|f) for allv € X. If 
f € Hi(Q) (k = 0,1,2,...), then u € Hy42(Q) and there is a constant 
C > 0, independent of u and f, such that 


{lulles2,a < C(I flle.n + [lullo,a)- 


Proof: We begin by taking a covering of OQ by open sets V;,..., Vir 
such that each V,NQ can be mapped to a half-ball N(r) as in the discussion 
preceding Theorem (7.29), We can then find an open covering Wo,..., Wu 
of 91 such that Wy C Q and W; C V; for j > 1. 

We know that u € Hz42(Wo) by Theorem (6.33). To obtain an estimate 
for |lulle+2,wo, let Uo = Q and Uz41 = Wo, and interpolate a sequence of 
open sets U,,...,U;, such that U; 5 Oja1 for 0 < j < k. For each such 
j, choose ¢; € C3°(U;) with ¢; = 1 on Uj4i. By Corollary (6.31) (with 
t=0), 

llulli+2c sa. S MGulls42 < Ci (Ls ull; + IG ullo) 
S Cy (Gj Lully + LL, Gulls + lg ullo) 
S Ca({[Lullj,o + llulljsi.0; + lullo,a), 


since [L, ¢;] is a first order operator with coefficients supported in U;. Com- 
bining these estimates for 0 < j < k with the equation Lu = f yields 


llulles2,wo < Ca(Ilflle.n + llullia)- 
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On the other hand, when V; NQ is mapped to a half-ball N(r), Wj NQ 


will be mapped into a half-ball N(p) with p < r. Hence, by Theorem (7.29) 
we have u € Hxyo(W; AQ) for j > 1, and 


llulleg2,wjnan < Ca(Ilflle,vjnn + llullivjan) < Ca({L fille, + lulls). 
Since 2 = Wo U UM (WwW; AQ), therefore, u € Hy42(Q) and 
iJullez2.a < Cs(l{Flle.n + [lullia)- 


It remains to obtain the sharper estimate with |jull1,q replaced by 
\ullo,a. This follows from the coercive estimate: 


Wull}.n S C(LD(u, u)| + [lello,n) < Cu | FY] + llello,) 
< Cllullo,a(llFllo.a + [lullo,a) < Chlullra(Ilfllo,a + lullo,n), 


so that 
llullaa < C([lFlloa + llullo,a) < C(I lle.n + Ilullo,a)- 


The proof is complete. i] 


(7.33) Corollary. 
If f € C°(Q) then u € C%(Q). 


Proof: Use the Sobolev lemma (6.45). 1 


(7.34) Corollary. 
If u € X is an eigenfunction for D, i.e., if for some constant » we have 
D(v,u) = A(v | u) for allvu € X, then uE CQ). 


Proof: Apply Corollary (7.33), with f = 0, to the Dirichlet form 
D'(v,u) = D(v,u) — A(v | u) for L—A. i] 


EXERCISES 


1. Show that the Dirichlet form D(v,u) = (Av|Au) is coercive over 
H2(Q) 9 H9(Q), for any Q. (Cf. Exercise 1 in §7C and Exercise 1 
in §7D.) 
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2. Let L be a formally self-adjoint strongly elliptic operator of order 2 on 
Q, let {u;} be the eigenbasis for the Dirichlet problem as in Theorem 
(7.23), and let {4;} be the corresponding eigenvalues. 

a. Suppose f = So ejuj; € Ho({) and c; = 0 whenever yp; = 0. Show 
that the solution of the Dirichlet problem Lu = f, u € H?(Q), is 
u=y> Byte; uj. 

b. Suppose u € H2(Q) and f = Lu € Ho(Q) have the expansions u = 
¥Db;u; and f = Se;u;. Show that the following are equivalent: 
(i) u © H9(Q), (ii) ej = wyby, (iit) To [ajbyl? < 00. 

c. Suppose u = )>b;uj € Ho(Q). Show that the series )>b;u; con- 
verges in the norm of Ho,(Q) (k > 1) if and only if u € Ho,(Q) and 
Liu € H9(Q) for0<j<k. 


G. Further Results and Techniques 


The regularity theorem (7.32) is true for much more general boundary 
value problems for elliptic operators of any even order 2m if one replaces 
the subscript k +2 by k +2m. For example, it is true for the Dirichlet 
problem for an arbitrary strongly elliptic operator. Proofs of this fact, in 
the same spirit as our proof of Theorem (7.32), can be found in Agmon 
{3], Friedman [21], and Bers, John, and Schechter [7]. The first two of 
these deal explicitly with the more general case where the coefficients of 
the operator are not assumed to be C®™, in which case one only obtains 
regularity up to the order of smoothness of the coefficients; however, this 
extension is straightforward and involves no essentially new ideas. Agmon 
[3] also contains some interesting material on the spectral theory of elliptic 
operators. 

More generally, suppose L = )- a, 0° is an elliptic operator (with C™ 
coefficients) of order 2m on. Let J be a subset of [0, 2m—1] of cardinality 
m, and let {B;};e7 be a normal J-system on S = 09. Consider the 
boundary value problem 


(7.35) Lu=f onQ, Byu=0 on S for j € J. 


To describe the regularity theorem for (7.35) we need to introduce three 
definitions: 


i. Lis properly elliptic if for every z € 2 and every pair 1, € of linearly 
independent vectors in R", the polynomial P(z) = yz(z, 1+z&2) has m 
roots with positive imaginary part and m roots with negative imaginary 
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part. (Recall that x2(z,€) = Yjajsom a(2)é% is the characteristic 
form of L as defined in §1A.) Every strongly elliptic operator is properly 
elliptic, and the proof of Proposition (7.1) shows that every elliptic 
operator on 9) C R” is properly elliptic if n > 3. 


ii. Let L be properly elliptic. The operators B; cover L (or satisfy the 
complementing condition or the Lopatinski-Shapiro condition) 
if the following condition holds. Let x be any point on S, v the normal 
to S at x, and € any tangent vector to S at x. Let z,(2,&),...,2m(2,€) 
be the roots of the polynomial P(z) = yx(z, € + zv) with positive 
imaginary part, let Py(z) = []{'(z—2;(z, €)), and let [(P,) denote the 
ideal in the polynomial ring C[z] generated by P,. Then the polyno- 
mials Q;(z) = xa; (z, € + zv) (j € J) are linearly independent modulo 
1(P4). 


iii. Let us call the problem (7.35) regular if there are finite dimensional 
subspaces V and W of C®() and a sequence of positive constants 
Co, C,... with the following properties. Let V+ denote the orthogonal 
complement of V in Ho(Q). Then for any f € V4 M H,(M) there is a 
solution u € Hy4om(Q) of (7.35); u is unique modulo W and satisfies 


llulle+om.a < Ce ((lflle.a + [lullo,n)- 


We then have the following theorem, due to Agmon, Douglis, and Niren- 
berg: 


(7.36) Theorem. ; 
Suppose L is properly elliptic. The problem (7.35) is regular if and only if 
the operators {Bj;}jey cover L. 


An exposition of this result along the lines of the original arguments, in- 
cluding a detailed discussion of the generalization to inhomogeneous bound- 
ary conditions, can be found in Lions and Magenes [34]. See also Mi- 
randa [37] for a historical discussion. However, the best proofs of Theorem 
(7.36) now available use the technology of pseudodifferential operators; see 
Hormander (27, vol. III], Taylor [48], or Treves [53, vol. I]. 

It follows from results of Agmon [2] that if the problem (7.35) arises as 
a (D,X) problem as in (7.10), the coerciveness of D over X implies that 
the operators B; cover L, so that Theorem (7.36) can be applied. Theorem 
(7.36) is much more general, however. It should also be mentioned that 
not all interesting (D,X) problems are coercive. The most important and 
intensively studied non-coercive problem is the 9-Neumann problem that 
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arises in complex analysis in several variables; see Folland and Kohn [18] 
for the basic theory and Greiner and Stein [23] for more refined results; 
also Stein [46, §XIII.6}. 

Although L? methods tend to be relatively simple and elegant, they are 
not the alpha and omega of elliptic regularity theory. For example, they 
do not yield the solution u € C(Q) of the Dirichlet problem 


Au=0o0nQ, u=fonS (f €C(S)) 


which we obtained in Chapter 3. For another example, suppose we wish to 
solve Au = f where f is (say) C* with compact support. Theorem (6.33) 
guarantees that every solution u is in Ay, and hence (by the Sobolev 
lemma) in CY for any j < k+2—4n. But Theorem (2.28) shows that u 
is in C*+1+4 for any # < 1 and is in C*+2+% (0 < @ < 1) provided that 
f € C+. In general this is a considerable improvement over the result 
obtained by L? methods. 

To get the best possible results, one has to use a variety of function 
spaces, the most important of which are the L? Sobolev spaces L? that 
were introduced at the end of §6A and the Holder or Lipschitz spaces 
ctt*, 0 <a <1. (See Stein [45], Adams [1], and Nirenberg [38] for 
discussions of these spaces and their relationships.) As we mentioned at 
the end of §6C, an analogue of the local regularity theorem (6.33) holds 
for these spaces, and the same is true of Theorem (7.36). That is, if the 
problem (7.35) is regular, then u € L7,.,,(2) whenever f € L{(2), and 
u € Ck+2m+0(0) whenever f € C*+%(Q). These facts can be established 
by using the solution of (7.35) via pseudodiflerential operators together 
with the L? and Holder estimates for these operators; see Taylor [48]. 


H. Epilogue: the Return of the Green’s Function 


At last we can tie up the loose ends in §2E. The result we need is the 
following: 


(7.37) Proposition. 
Let 2 be a bounded domain with C© boundary S, and suppose f € C™(®). 
Then the solution u of the Dirichlet problem 


Au=0o0nQ, u=fonS 


(which exists and is unique by Theorem (3.40)) is in C°°(Q). 
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Proof: Let v be the solution of 
Av = Af on Q, v=OonS. 


uv exists and is unique by Theorem (7.20), since the Dirichlet problem for 
—A is strictly coercive by Theorem (7.14). Moreover, Corollary (7.33) 
shows that v € C~(Q). But A(f — v) =0 on 2 and f —v = f on S, so by 
the uniqueness theorem (2.15), u= f —v € C™(Q). i] 


Now we can construct the Green’s function on 2. Let N be the fun- 
damental solution for A given by (2.18), and for each z € 2 let uz be the 
solution of the Dirichlet problem 


Au, =00nQ, uz(y) = N(z—y) for ye S. 


uz exists, is unique, and is C@ on 2 by Proposition (7.37), since N(z—-) 
agrees on S with a function f, € C(Q). (For example, take f,(y) = 
(1 — o2(y)|N(@ — y) where ¢; € C&°(Q) and ¢, = 1 on a neighborhood of 
z.) Then the Green’s function is 


G(z,y) = N(z — y) — uc(y). 


G(z,:) is thus C® on © \ {x}, so Claim (2.35) is established. 
We now restate and prove Claim (2.38): 


(7.38) Proposition. 
Let g be a continuous function on S, and define w on Q by 


w(z) = if a(v)0.,G(2, v) do(y). 


Then w extends continuously to 2 and solves the Dirichlet problem 
Aw = 0 on Q, wa=gonS. 


Proof: The solution to this Dirichlet problem exists and is unique 
by Theorem (3.40); call it u. We need only show that u = won 2. By 
the Weierstrass approximation theorem (4.9), we can find a sequence of 
polynomials {g;} that converges uniformly to g on S. For each j let u; be 
the solution of the Dirichlet problem 


Au; = 0 on Q, uj = gj on S. 
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By Proposition (7.37), uj € C%°(Q). The remarks in §2E preceding Claim 
(2.38) then show that for all  € 2, 


uj(2) = [ uj(v).,G(2,9) do(w) = f 95(0)8, (2,9) do(y). 
Now let 7 — oo. On the one hand, it is clear that for each zr E22, 
[as(2.,6(2,n doy) — f 9(w8.,6(2,2) dety) = w(2). 


On the other hand, the maximum principle (2.13) implies that uj; — u 
uniformly on {. Thus u = w, and we are done. | 


Chapter 8 
PSEUDODIFFERENTIAL OPERATORS 


The theory of pseudodifferential operators was initiated around 1964 by 
Kohn and Nirenberg, although it has roots in earlier work in the theory 
of singular integrals and Fourier analysis, and it was subsequently refined 
and extended by a number of other authors, notably Hormander. It has 
become one of the most essential tools in the modern theory of differen- 
tial equations, as it offers a powerful and flexible way of applying Fourier 
techniques to the study of variable-coefficient operators and singularities of 
distributions. In this chapter we shall present the elements of this theory 
together with a few applications. More comprehensive accounts, includ- 
ing many mote applications, can be found in Taylor [48], Treves [53], and 
Hormander (27, vol. III]; see also Saint Raymond [42] for a detailed ele- 
mentary treatment and Stein [46] for related recent developments. 


A. Basic Definitions and Properties 


Throughout this chapter we shall adopt the notational convention for deri- 
vatives introduced in §6D, namely 
1 


1 
—= i: . ;— : as _ 
D Oni 79, ie., D; Oni 39 and D 


1 or 
(irae 


The convenience in this lies in the Fourier transform formula (D®u) (€) = 
€°u(€). Any linear differential operator with C™ coefficients on an open 
set 2 C R® can then be written in the form 


L= oy ay(r)D* (a, €C™(M)), 


lal<k 
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and the function 
o1(2,€)= >> aa(z)é* 
lal<k 
is called the symbol of L. It can be used to represent the operator L as a 
Fourier integral; namely, on applying the Fourier inversion theorem to the 


formula (Du) (€) = €%(€), for u € C2°(Q) we obtain 


Lu(z) = Waals) fer *erare) dé = [et *Sou(e,8 a6) dé. 


To rephrase this: Differential operators with C® coefficients on 2 are the 
operators of the form 


(8.1) Lu(z) = ef =k p(x EUE)dE (we CR(M)), 


where p(z,£) is a polynomial in € with coefficients that are C™ functions 
of rE. 

The idea of pseudodifferential operators, then, is to consider operators 
of the form (8.1) where p is a more general sort of function. Actually, if one 
allows general functions or distributions p in (8.1), one obtains an enormous 
family of operators that is much too diverse to support an interesting the- 
ory. (For example, any continuous linear map from S(IR”) to 8‘(IR") can be 
represented in the form (8.1) where p is a tempered distribution on R" x R® 
and the integral is interpreted in the sense of distributions. See Exercise 1 
in §8B.) We shall restrict attention to functions p(z,€) that behave quali- 
tatively like polynomials or homogeneous functions of € as + oo — that 
is, they grow or decay like powers of ||, and differentiation in € lowers the 
order of growth. 

To be precise, suppose 2 is an open set in R” and m is a real number. 
(Note: In this chapter, in contrast to the preceding one, we do not assume 
that 2 is bounded.) The set of symbols of order m on 2, denoted by 
S™(Q), is the space of functions p € C™(Q x IR”) such that for all multi- 
indices a and @ and every compact set K C 2 there is a constant Coax 
such that 


(52) sup |DEDEp(z,£)| < Co,a,«(1 + lel!" 


A pseudodifferential operator (or ¥DO for short) of order m on 2 is 
a linear map from C2°(Q) to C™(Q) of the form (8.1) where p is a symbol 
of order m on 2. We shall generally denote the map in (8.1) by p(z, D), 


(8.3) p(2, D)u(2) = ; ek n(x, €)a() dé, 
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and we denote the set of pseudodifferential operators of order m on 2 by 
w™(Q): 
W™(Q) = {p(z, D): pe S™(Q)}. 

Warning: Our placement of the factor of 27 in (8.3) flies in the face of 
convention. It is much more common in the literature on pseudodifferential 
operators to define the Fourier transform without the 27 in the exponent, 
and accordingly to modify (8.3) by omitting the 27 from the exponent and 
inserting a factor of (27)~" in front of the integral. In particular, this 
entails defining D as (1/i)0 rather than (1/27%)d. 


Example 1. 
If p(z,€) = Dace Ga(z)E%, then p € S*(Q). In other words, every differ- 
ential operator with C@™ coefficients on 2 is a YDO on 2. 


Example 2. 

Let us say that a function p € C®(Q x R") is homogeneous of degree 
m for large & (m € R) if there exists c > 0 such that p(z, t£) = t™p(z, £) 
whenever ¢ > 1 and |€| > c, or in other words, if p agrees with a func- 
tion that is positively homogeneous of degree m in € on the set |£| > c. 
(The utility of this notion stems from the fact that homogeneous functions 
are never C™ at the origin unless they are actually polynomials.) If p 
is homogeneous of degree m for large , then Df Dé'p is homogeneous of 
degree m — {a| for large &, and it follows that p € S™(Q). More gener- 
ally, if p= oy p; where p; is homogeneous of degree m; for large €, then 
p € S™(Q) where m = max{m;}. 


Example 3. 

The function (z,€) — (1+ |€|?)*/? belongs to S*(IR") (Exercise 1), and 
hence the operator A* defined by (6.4), which plays an essential role in the 
theory of Sobolev spaces, belongs to ¥*(R"). 


The preceding examples of symbols are functions that are homogeneous 
in € or asymptotically homogeneous, in a suitable sense, as £ —> co. The 
symbol classes S™(Q) also contain other sorts of functions, however; see 
Exercises 2 and 3. 

If p € S™(Q) and q € S™(Q), it is clear that p+q € Smex(™m2)(Q), 
and a simple calculation with the product rule shows that pg € S™+™2(Q). 
Moreover, if m, < mz, we clearly have S™(Q) C S™?(Q), so it is natural 
to set 

Sore Lis), se Ojya (sa). 


meR meR 
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S°(Q), the set of symbols of arbitrary order on &, is thus a filtered algebra. 
Likewise, we set 


wr(av= LJ wa), wr (Q) = (} W™(Q). 


meR meR 


¥(M), the set of pseudodifferential operators on Q, is also a filtered al- 
gebra, but the fact that if P € W™'(Q) and Q € Y™(Q) then PQ E€ 
wm:+™2(Q) requires some proof that we shall not give until §8D. 

Let us discuss the domain and range of pseudodifferential operators a 
little more fully. If p € S™(Q), the integral (8.3) converges for x € 2 
whenever u € S. However, it is more suitable to think of the domain of 
p(x, D) as consisting of functions living on 2, so to begin with we take the 
domain to be C2°(Q). If u € Co°(Q), the differentiated integrals 


D*[p(z, D)u(z)] = / D2[e?**€ p(x, €)}a(€) dé 


converge absolutely and uniformly on compact subsets of 2 since u € 5 
and the derivatives of p(z,é) and e?*'*€ have polynomial growth in ¢. 
It follows easily that p(z,D)u € C™(Q), and moreover that p(z,D) is a 
continuous linear map from C2°(Q) to C™(Q). That is, if uy, ue,... are 
elements of C'°(Q) that are supported in a common compact subset of 2 
and D®u; — Du uniformly for all a, then D*[p(z, D)u;] + D*[p(x, D)u] 
uniformly on compact subsets of 2 for all a. 

We would like to relax the requirements that u be smooth and com- 
pactly supported. To deal fully with the issue of compact support requires 
a restriction on the symbol p that we shall discuss in §8B, but the smooth- 
ness condition can easily be dispensed with, as there is a natural way of 
defining p(z, D)u as a distribution on 2 whenever u is a distribution with 
compact support on 2. Indeed, if u € Co°(Q), for any ¢ € C2°(Q) we have 


(ole, Dyw, 6) = ff Pr &p(2, 4) 4(@) dg dz = f 99(€)a(E) ae, 
where 
(8.4) ao(6) = f p(x, oe) de. 
(8.5) Lemma. 


If p € S™(Q) and ¢ € Co°(Q), the function gg defined by (8.4) decays at 
infinity more rapidly than any power of €. 
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Proof: For any £,7 € R” we have 
nt [ etep(2,6)6(2) de = f Dee" ")p(z, 8) (2) de 
= (-1)t f Pra Detple,e)ole)) de. 
Since the derivatives of p(z,€) are all dominated by (14+-|€|)” for x € supp ¢, 


< Ca(1 + |é1)™ 


ne i ep (2, €)4(2) dz 


for all a, so 


(1+ Inf)” 


fi e102, €)(2) dz 


< Cn(1 + lél)” 
for all N. Setting n = €, we obtain the desired result: 
Ig(€)1 < Cw (1 + le. ' 


Now, if u € €/(Q), or more generally if u € H, for some s € R (cf. 
Corollary (6.8)), then @ is a function that is square-integrable with respect 
to (1 + |é|?)~* dé for some s € R; hence Lemma (8.5) shows that the 
integral [ gg% is convergent. Moreover, the proof of Lemma (8.5) shows 
that if ¢; + ¢ in CO°(Q) then (1+|€|)%¥ 9g, (€) > (14+ 1él)% gg(€) uniformly 
on R® for all €, so that f 94, — f ggit; that is, ¢ + f gg is a continuous 
linear functional on C2°(2). In short, we can define p(x, D)u € D’(Q) for 
any u € €'(Q) by 


(ole, Dyn, 6) = (96,2), sal = i e245 f(x, €)6(z) dz. 


It follows easily from this construction that if u; — u weakly in €’(Q) then 
p(z, D)uj — p(z, D)u weakly in D’(Q). 

We conclude this section by remarking that the theory of pseudodiffer- 
ential operators has been extended to include classes of symbols and op- 
erators that are wider than S°(Q) and ¥°(Q). The operators in ¥° (2) 
have earned the name of classical pseudodifferential operators, they are 
the operators that have come to be a standard tool for studying all sorts 
of differential equations, and they are the only ones we shall consider in 
this book. However, more general symbol classes have been found useful 
for various problems. The ones the reader is most likely to encounter are 
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Hormander’s classes S™,(Q) (m € R,0 < 6 < p < 1), which are defined 
just like S"({) except that condition (8.2) is replaced by 


det {Dé Dé p(z, é)| < Cap,K(1 + je|)™— Platt sll 
Zz 


(Thus, S™(Q) = Sf(Q).) Most of the theory that we shall develop for 
S™() generalizes to S7,() in a fairly straighforward way provided that 
6 < p. The theory for p = 6 < 1 is more subtle but also richer in interesting 
applications. (The cases 6 > p and 6 = p = 1 are pathological.) The reader 
may find an account of the theory of ¥DO with symbols in S7*,(() in Taylor 
[48] or Treves [53, vol. I], and extensions of the theory to much more general 
symbol classes in Beals [5] and Hormander (27, vol. IIJ]. 


EXERCISES 
1. Let p(z,€) = (1+ |é|?)*/? (s € R). Show that p € S*(R"). 


2. Pick ¢ € C?°(IR") with @ = 1 on a neighborhood of the origin, and let 
p(z,€) = [1 — 4(€)] sin log |é|. Show that p € S°(R*). 


3. Suppose p € S°(Q) and y is a C© function on a neighborhood of the 
closure of the range of p (a subset of C). Show that pope S°(Q). 


4. Show that if p € S~°(Q) then p(z, D) maps €'(2) into C™(Q). 


B. Kernels of Pseudodifferential Operators 


Suppose T is an integral operator on some space of functions on 2 that 


includes C2°(Q): 
Tu(2)= [ K(vuy)dy (we CP). 
If v € C°(Q), we have 
[ Tateyo(e) az = Ths K(z, y)v(x)u(y) dy dz, 


or in the language of distributions, 


(Tu, v) = (K,v® u), 
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where v @u € C°(D x Q) is defined by 
(v @u)(z,y) = v(z)u(y). 


There is now an obvious generalization: Suppose T is a continuous linear 
map from C%(Q) to D'(Q). If there is a distribution K on Q x 2 such that 


(Tu, v) = (K,v@u) (u,v € C2(Q)), 


then K is called the distribution kernel of T. K is uniquely determined 
by T because the linear span of the functions of the form v @ u is dense 
in C2°(Q). For example, the distribution kernel of the identity operator is 
the delta-function 6(z — y). 

In fact, every continuous linear T : C9°(Q) — D’(Q) has a distribution 
kernel. This is one version of the Schwartz kernel theorem, for the proof 
of which we refer to Treves (49] or Hormander [27, vol. I]. Another one 
is that if T is a continuous linear map from S(IR") to 8/(IR"), there is a 
K € 8'(R" x R”) such that (Tu, v) = (K,v@u) for all u,v € S(R"). These 
facts provide a helpful background for the discussion of distribution kernels, 
but we shall have no specific need for them. 

It is easy to compute the distribution kernel of a pseudodifferential 
operator. Indeed, if p € S™(Q), 


(ole, Dy) = ff p(x, €) (gol) de de 
= | [fee n2,8)0@ ula) dy de, 


from which it follows that the kernel KC of p(x, D) is 
(8.6) K(z,y)=pr(z,e-y) (2, y EQ), 


where py denotes the inverse Fourier transform of p in its second variable. 
Of course, (8.6) is to be interpreted in the sense of distributions. That is, 
p(z,-) is a tempered distribution depending smoothly on z, so the same 
is true of py(z,-). In particular, py defines a distribution on 2 x R"; 
composing it with the self-inverse linear transformation (z, y) — (x, z— y) 
yields another such distibution, and K is the restriction of the latter to 
Nx 2. 

The distribution K turns out to be nicer than one might initially expect: 
It is a C™ function off the diagonal 


(8.7) An= {(z,y)EQxQ:2=y}, 


and its singularities along the diagonal can be killed by multiplying it by 
suitable powers of z — y. More precisely: 
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(8.8) Theorem. 

Suppose p € S™(Q) and K is the distribution on Q x Q defined by (8.6). 

a. If |a| > m+n+j, the function fa(z,z) = zp (a, z) is of class C) on 
Q x R®. Moreover, f, and its derivatives of order < j are bounded on 
Ax R® for any compact AC 2. 

b. If lal > m+n+ J, (2 — y)* K(z,y) and its derivatives of order < j are 
continuous functions on Q x Q. In particular, K is a C® function on 
(Q x 2)\ Ag. 


Proof: z“p¥(z, z) is the inverse Fourier transform (in the second vari- 
able) of (-1)!*!D¢p(z, €). For z in any compact set A, the latter function is 
dominated by (1+[é])”7~!*l; in particular, if |a| > m+n, it is integrable as 
a function of , so the Fourier transform can be interpreted in the classical 
sense: 


fales2) = (—1N" f ePrt*€Dzp(e,8)d€ (lal > m+n). 


Thus fa is a bounded continuous function on A x R". Moreover, if the 
integrand is differentiated no more than j times with respect to z or z, it 
is still dominated by (1+ |é|)!¢l-™+ for z € A, so if ja] > m+n+j one 
can differentiate under the integral and conclude that the derivatives of f,. 
of order < j are bounded and continuous on A x R". This proves (a), and 
(b) then follows by virtue of (8.6). | 


From this result we can deduce an important regularity property of 
pseudodifferential operators. Some terminology: The singular support 
of a distribution u € D’(Q) is the complement (in 22) of the largest open set 
on which u is a C™ function; it is denoted by sing suppu. A linear map 
T : E'(Q) > D'(Q) is called pseudolocal if sing supp Tu C sing supp u 
for all u € E/(Q). (The motivation for this name is as follows: A linear 
operator T on functions is called local if Tu, = Tuz on any open set where 
uj = ug; this is equivalent to the requirement that supp Tu C supp u for 
all u. Pseudolocality is the analogue with support replaced by singular 
support.) For example, all differential operators with C™ coefficients are 
pseudolocal (and also local); hypoellipticity of a differential operator L 
means precisely that the reverse inclusion sing supp u C sing supp Tu also 
holds. 


(8.9) Theorem. 
Every pseudodifferential operator is pseudolocal. 
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Proof: Suppose P € W™(Q) and u € €’(Q). Given a neighborhood 
V of sing supp u in Q, choose ¢ € C>°(V) such that ¢ = 1 on sing supp u, 
and set ui = du and ug = (1—¢)u. Then u = uy + we, suppu, C V, 
and uz € Co°(Q). If K is the distribution kernel of P, by Theorem (8.8) 
K(z,y) is a C® function for z ¢ V and y € V. Hence for z ¢ V we can 
write 

Pus(t)=(K(2,),m) =f  K(e,v)ua(v)ady, 
SUPP ty 

from which it is clear that D? Pu,(z) = (D2 K(z,-), u:) and hence that Pu, 
is C™ outside of V. On the other hand, Puz € C™(Q) since uz € C&?(Q). 
Thus Pu is C® outside V, and since V is an arbitrary neighborhood of 
sing supp u, Pu is C™ outside sing supp u. i 


We shall call a linear map T : €/(Q) — D/(Q) a smoothing operator 
if the range of T is actually in C™(Q), that is, if sing supp Tu = @ for 
all u € €/(Q). As another easy consequence of Theorem (8.8), we have the 
following. 


(8.10) Proposition. 
Every P € ¥~™(Q) is a smoothing operator. 


Proof: By Theorem (8.8), the distribution kernel K of P is C@ on 
Q x Q, so as in the proof of Theorem (8.9) we see that Pu(z) = (K(z,-),u) 
is a C™ function for every u € E’(Q). ] 


Not every smoothing operator is pseudodifferential, however. For exam- 
ple, if p(z,é) = o(z)¥(€) where ¢ € C™(IR") and y € E’(IR”) then the op- 
erator p(z, D) defined by (8.3) is smoothing (in fact, p(z, D)u = ¢(u* yp), 
which is C® since ~Y is C®), but p does not belong to our symbol classes 
unless y is C™. 

We next address a point glossed over earlier, namely, the injectivity of 
the symbol-operator correspondence. Suppose p € S™(Q). If we regard 
the domain of the operator P = p(z,D) as S(IR") then p is completely 
determined by P, for in the integrals 


p(2, D)u(2) = / PME n(n, £)A(E) de 


we can choose u so that & approximates the point mass at an arbitrary 
point of R". However, if we restrict P to C2°(Q), as we have agreed to do, 
P does not completely determine p unless 2 is very large. The situation is 
explained in the following proposition. 
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(8.11) Proposition. 

Suppose p € S™(Q) and p(z, D) = 0 as a map from C&°(Q) to C™(Q). 

a. If the sete QD-Q = {zr —-y:z,y € Q} is dense in R" then p is necessarily 
zero, but otherwise p may be nonzero. 

b. In any case, p € S~™(Q). 


Proof: p is related to the distribution kernel K of p(z, D) by (8.6): 
K(2,y) = p¥(z, 2 —y) for z,y € Q, so if p(z, D) = 0 then py(z,z) = 0 
for z € Qand z € N-Q. Since Q — Q is an open set containing the 
origin, Theorem (8.82) shows that p}(2,-) is a Schwartz class function 
depending smoothly on z, so the same is true of p(z,-). This means that 
p € S~(Q), so (b) is proved. Moreover, if Q — Q is dense in R"”, the 
restriction of py to Q x (Q —Q) determines py, and hence p, completely. 
On the other hand, if 2 — Q is not dense, we can take p(z,é) = v(x) $(E), 
where y € CO°(Q), @ € CO°(R"), and supp ¢ is disjoint from N — Q; then 
K(z,y) = ¥(z)é(2 — y) = 0 on Q x Q and hence p(x, D) = 0. i 


One drawback to pseudodifferential operators is that their domains con- 
sist of compactly supported functions or distributions while their ranges do 
not, so in general they cannot be composed with one another. It is therefore 
often convenient to impose an additional condition on them to overcome 
this difficulty. The full solution to this problem will be achieved in §8D; 
the discussion that follows is meant to lay the groundwork for it. 

Suppose 2 is an open subset of R", and denote by wz and ay the 
projection maps from Q x Q onto the first and second factors, respectively. 
A subset W of Q x Q is called proper if m71(A)MW and m71(A) NW are 
compact whenever A is a compact subset of 2. For example, the diagonal 
An (see (8.7)) is proper; most of the proper sets we shall consider will be 
neighborhoods of Ag. (See Exercise 2.) 

Next, suppose T is a linear map from Co°(2Q) to C(Q) with distribu- 
tion kernel K. T will be called properly supported if supp K is a proper 
subset of Qx 2. 


(8.12) Proposition. 
T : C2(Q) + C(Q) is properly supported if and only if the following two 
conditions hold: 
i. For every compact A C Q there exists a compact B C Q such that 
supp Tu C B whenever suppu C A. 
ii. For every compact A C Q there exists a compact C C Q such that 
Tu = 0 on A whenever u = 0 on C. 
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Proof: If K is the distribution kernel of T and supp K is proper, (i) 
and (ii) hold if we take 


B= 1,[171(A) Nsupp K], C = my [xZ1(A) supp K]. 


This follows easily from the formula T'u(x) = f K(z, y)u(y) dy; see Figure 
8.1. For the same reason, if (i) and (ii) hold and A C 2 is compact, we 
have 771(A)Msupp K C B x A and 7z1(A)M supp K C Ax C. We leave 
the details to the reader as Exercise 3. i] 


Figure 8.1. Schematic representation of Properties (i) and (ii) in 
Proposition (8.12). The square represents 2 x Q and the shaded region 
represents supp Kx. 


Property (i) clearly implies that T maps C9°(Q) into itself, and property 
(ii) implies that T has a continuous extension to a linear map from C™(Q) 
to itself. Indeed, if u € C™(Q), we define Tu on an arbitrary compact 
subset A of 2 as follows. Let C be as in (ii), pick ¢ € CO°(Q) with @ = 1 
on C, and set Tu(z) = T[du](x) for c € A. This is independent of the 
choice of ¢, for if ¢’ is another such function we have du — ¢/u = 0 on 
C and hence T[¢u] — T[¢’u] = 0 on A. It follows that the definitions of 
Tu on two different A’s agree on their intersection (take a ¢ that equals 1 
on both C’s). This procedure defines Tu as a C™ function on Q, and the 
continuity of T on C'™(Q) follows from its continuity on C°(Q). 


(8.13) Corollary. 

If T and S are properly supported, one can form their composition ST as 
an operator on either CS°(Q) or C%°(Q). Moreover, ST is also properly 
supported. 
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Proof: It is easily verified that properties (i) and (ii) of Proposition 
(8.12) are preserved under composition. i] 


Every differential operator is properly supported. In fact, the distribu- 
tion kernel of )> aa(z)D® is }> aa(z)D%5(2 —y), whose support is a subset 
of the diagonal Ag. In §8D we shall show that every pseudodifferential op- 
erator can be modified in a rather harmless way so as to become properly 
supported, and that the properly supported pseudodifferential operators 
form an algebra under composition. 

For the moment, we observe that if T is a properly supported WDO, 
then T maps €’(2) into itself and extends to a continuous map of ‘D/(Q) 
into itself. The first assertion holds because any u € €’(Q) is the limit 
of a sequence of functions u, € C9°(Q) supported in a common compact 
neighborhood of supp u (convolve u with a suitable approximate identity); 
by Property (i), the T'u, are supported in a common compact set B and 
hence supp T'u C B. A similar approximation argument that we leave to 
the reader (Exercise 4) shows that the procedure we used to define T on 
C™(Q) can be extended to define T on D/(Q). 

We conclude with a couple of technical results on cutoff functions re- 
lated to proper sets which will be needed in later sections. 


(8.14) Proposition. 
If W is a proper subset of Q x Q, there exists ¢ € C~(Q x Q) such that 
supp ¢ is proper and ¢ = 1 on a neighborhood of W. 


Proof: We begin with the following fact whose proof we leave to the 
reader (Exercise 5): There exist sequences {U;}{° and {V;}?° of open sets 
such that UP U; = 2, U; C Vj, V; is compact, and each point in 2 has a 
neighborhood that intersects only finitely many V;. Let 


J= {(3,k): WA(U; x Ur) $B}. 


We claim that for each j there are only finitely many k such that (j,&) € J. 
If not, there exists a j and a point (zz, yx) € WA(U; x Ux) for all k in some 
infinite set K. Since 771(U;) MW is compact, the sequence {(zz, yk) }eex 
must have a cluster point. But this is impossible, for every y € Q has a 
neighborhood that meets only finitely many U; and hence contains at most 
finitely many yx. 

Next, let 
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We claim that X is proper. It suffices to show that 77!(Uj;)NX is compact 
for all j, since every compact set in 2 is contained in a finite union of U;’s. 
(The same argument will also work with a; replaced by wy.) But 


me (Uj) X = 4Z (Uj) A J{V; x Vi V5 Nj, # B, (5,k) EI}. 


The union on the right is finite, for there are only finitely many V; that 
meet the compact set U;,, and for each such 7 there are only finitely many 
k with (j,k) € J. Since the sets V; x V; are compact and x7!(U;,) is 
closed, we are done. 

Finally we can construct the desired function ¢. For each j, pick a 
nonnegative ¥; € C°°(Q) such that ; = 1 on U; and supp y; C Vj, and 
let (z,y) = Lue pj (2)vi(y). Each (z, y) has a neighborhood on which 
only finitely many terms of this sum are nonzero, so » € C™(Qx 2). Also, 
supp ¥ is proper since it is contained in X, and # > lon Uy; gye3(Uj x Ue), 
which is a neighborhood of W. Hence we may take ¢ = ¢ o w where 
¢:R-+ (0,1) is a C™@ function such that ¢(t) = 0 for t < 0 and C(t) = 1 
fort > 1. 1 


(8.15) Proposition. 

If N is a neighborhood of the diagonal Ag, there exists ¢ € C@(Q x Q) 
such that: 

a. supp ¢ Is proper and contained in N, 

b. 6 =1 on a neighborhood of Ag. 


Proof: Let U; and Vj; be as in the preceding proof. Each z € 2 
has neighborhoods O} and O? such that Of ¢ O? and O2 x 02 CN. 
Each Vj; can be covered by finitely many O}’s, say V; = UV; NO},. Let 
Ujx = Uj NO}, and Vjz = VjNOZ,; then U, , Ujx x Uje is a neighborhood 
of Ag and Uj Vix x Vix CN. We therefore obtain ¢ as in the preceding 
proof by modifying ¥(z,y) = Dik wje(z)vje(y), where now pj, = 1 on 
Uje and supp pjx% C Vie. i 


EXERCISES 


1. Use the Schwartz kernel theorem to show that every continuous linear 
map T from S(IR") to $/(IR") can be represented in the form (8.3) where 
p € S‘(IR" x R") and the integral is interpreted in the distributional 
sense. (Hint: Use (8.6) to define p.) 
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2. Show that {(z,y): 2? < y < 21/7} is a proper subset of (0,1) x (0, 1). 
3. Complete the proof of Proposition (8.12). 


4. Show that every properly supported ¥DO on (Q extends to a continuous 
linear operator on D’(Q). 


5. Prove the topological lemmathat begins the proof of Proposition (8.14). 
(Hint: First construct a sequence {S;} of open subsets of 2 such that 
Sj is compact, S; C Sj41, and Q = UP? Sj. Set U; = S; \ Sj-1 and 
Vj = Sj41\5j-2.) 


C. Asymptotic Expansions of Symbols 


Much of the calculus of pseudodifferential operators consists of performing 
calculations with “highest order terms” and keeping careful track of lower 
order “error terms.” For this purpose, the following notion of asymptotic 
expansion is extremely useful. 

Suppose {m;}§° is a strictly decreasing sequence of real numbers such 
that limm; = —oo, and suppose pj; € S™({) for each 7. We say that 
the formal series )->° pj is an asymptotic expansion of the symbol p € 
S™0(Q) if : 

p—>_p; € S™*(2) for all k > 0, 
j<k 
in which case we write 
P~ >>. 

0 
We emphasize that the series )-p pj need not be convergent, and if it is, 
its sum need not be p. The “sum” }~>° p; is simply a convenient way of 
keeping track of the sequence {p;}§°. 

For most purposes in the theory of pseudodifferential operators, it is 
only asymptotic behavior that really counts. If two symbols p and g have 
the same asymptotic expansion )>p;, then 


pP-g= (r-D») ~ (:- dn) €s™(Q) 


for all k, and hence p—q € S~™(Q); on the other hand, if p— q € S~™(Q) 
then the series g+0+0+---is an asymptotic expansion of p. Accordingly, 
symbols of order —oo are usually regarded as negligible. Likewise, on the 
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level of operators, WYDO of order —co — or more generally, smoothing 
operators of any sort — are usually regarded as negligible. In this sense, 
the symbol-operator correspondence p — p(z, D) is essentially a bijection 
for any 2; see Proposition (8.11). 

It is an important fact that asymptotic series can be used to build 
symbols: 


(8.16) Theorem. 
Suppose p; € S™i(Q) for each j > 0, where m; strictly decreases to —oo. 
Then there exists p € S™°(Q) (unique modulo S~™(Q)) such that p ~ 


Do Pi- 


Proof: Let {9;}$ be an increasing sequence of open subsets of 2 
with compact closure whose union is 2, and choose ¢ € C™(IR”) such that 
#() = 1 for |€| > 1 and 4(€) = 0 for |€| < 3. 

Claim: There is a sequence {t;}§° of positive numbers converging to 
0 such that for each j > 0, 


| DE De [4(tj€)p;(z, €)]| < 277 (1 + lel? I2! 


(8.17) . 
for z € Q; and ja] + |B] <j. 


Taking this for granted for the moment, we define 


oO 
p(z,€) = > o(tj€)pj(2, €)- 

j=o 
Since t; — 0, for € in any bounded set there are only finitely many nonzero 
terms in this sum, and it follows that p € C™(Q x RN"). To see that 
p € S™°(Q), suppose B is a compact subset of 2; we wish to estimate 
DE DE p(z, €) for z € B. Choose k large enough so that B C Qy and 
la| + |6| < &, and write 


P(z,£) = D> H(ty€)pj(z,€) + >> $(ts€)pi(z, €). 
isk j>k 


Since pj € S™ and ¢(t;€) = 1 for |€| large, we clearly have 


yo Dea(senst®.8) < S2o\(1 4+ lep™ “Pl SCC + eye. 


jsk isk 


On the other hand, by (8.17) we have 


> De DEa(sers(2.8)| < So a-F(1 + eyelet < (1 + fee lal. 


j>k j>k 
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Thus p € S™°(Q). The same argument shows that J~7° ¢(t;€)p;(z.€) € 
S™(Q) for all k; since $(t;£)p;(z,&) = p;(z,€) for € large, it follows that 
P~ Yo Pi: 

It remains to prove the claim. First, let us observe that Di{o(té)] = 
tI [DIg](té) and that, for y # 0, [DJ ¢}(té) = 0 except when |€| is compa- 
rable to t-!. If t < 1, this implies that 1+ |é| is also comparable to ¢—?, 
and it follows that 


|Dz[4(t€)]] < Cy + [é) 7"! for all t < 1. 


The product rule and the fact that pj € S4(Q) now easily imply that for 
some C; > 0, 


| D8 De [o(té)p;(2,€)]| < C1 + [EN !2! for 2 €Q;, lal +14 <9, t <1. 


Of course the expression on the left is actually zero if || < (2¢)-!. It 
therefore suffices to pick t; small enough so that (a) t; — 0 and (b) if 
[é| > (2t;)~* then Cj(1 + [E|)™—™-* < 2-4. 1 


In the next section we shall need a technical result which shows that a 
series )_ p; is an asymptotic expansion of a symbol p if certain apparently 
weaker conditions hold. Before stating and proving it, we need a lemma 
from calculus: 


(8.18) Lemma. 
If f : R—R is twice differentiable and f, f’, f" are bounded, 


(sup Is)” < 4 (sup LF(#)1) (sup 1F"(O))- 


Proof: By considering g(t) = a7'f(./a/bt) where a = sup |f(t)| 
and b = sup |f"(¢)|, we are reduced to proving that if sup |g(t)| < 1 and 
sup |g(¢)| < 1 then sup |g’(t)| < 2. Suppose instead that |g’(to)| > 2. 
By the mean value theorem, |g/(t) — g'(to)| < 1 for |é — tol < 1 and 
hence |g/(t)| > 1 for | — to] < 1. By the mean value theorem again, 
Ig(to + 1) — g(to — 1)| > 2, which contradicts sup [g(é)| < 1. I 


Remark: The optimal constant in Lemma (8.18) is 2, not 4; see 
Schoenberg [43]. 
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(8.19) Lemma. 
Let V be a compact subset of R" and W a neighborhood of V. There is a 
constant C > 0 such that for all f € C?(W), 


(sup 1asste)1) < © ( sup is(2)1) (Xmp (98 102)). 


Proof: Pick ¢ € C2°(W) such that ¢ = I on a neighborhood of V. 
The result follows by applying Lemma (8.18) to the real and imaginary 
parts of ¢f, considered as functions of x; alone (Exercise 1). i] 


(8.20) Theorem. 
Suppose pj € S™i(Q) for j > 0, where m; decreases to —oo, and p € 
C°(QxR"). Then p € S™(Q) and p~ >> p; provided that the following 
conditions hold: 
i, There exists a sequence of real numbers py with limp, = —oo such 
that for every compact BC Q, 


p(z,€) — >. p;(2,€) 


i<k 


sup §Can(l + lel). 
reB 


ii. If w and @ are multi-indices, there is a real number (a, 8) such that 


for every compact B CQ, 


sup [D8 Dz p(x, €)| < Ca,o,a(1 + |é|)Mo?. 
ré€B 


Proof: By Theorem (8.16), there exists g € S™°(Q) such that q ~ 
So Pj, and it will suffice to show that p— q € S~%(Q). 

First, by condition (i), for any compact BC QN and k > 0, ifz € B we 
have 


Ip(z, €) oa q(x, &)| < 


p(e.6) - ovite.2)| + 
j<k 
< Caa((l + le) + (1+ le]. 


de) Dvil2.6) 


d<k 


Since py, —- —oo and m, ~— —00, 
(8.21) sup [(p— 9)(2,6)1 $ Ca.w(1 + le)-™ 


for all N. 
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We need to prove that estimates of this sort hold also for D8 De (p49). 
For the case |a|+([@] = 1, for each € we apply Lemma (8.19) to the function 
f(z,n) = (p—a)(z, +7), taking V = B x {0} and W = Bx {7: |n| < 1} 
where B! is a compact neighborhood of B in 2. By condition (ii) and the 
fact that q¢ € $™°(Q), for some » € R we have 


sup |DEDE(p—a)(z.€E+mM1< Ca(1+lél)* (lal +16] < 2. 
z€B’, |ni<i 


Lemma (8.19) together with (8.21) therefore gives 


sup |DE De (p— 9)(2,6)1< Caw lele-% — (lel +141 = 1) 


for all N, as desired. The proof is now completed by induction on |a] + [A]: 
assuming that estimates of the type (8.21) for D8 D?(p—q) with |a|+|A| = 
k—1, Lemma (8.19) together with condition (ii) yields estimates of the type 
(8.21) for D8 D?(p — q) with |a| + |A| = k. ! 


EXERCISES 
1. Complete the proof of Lemma (8.19). 


2. Adapt the proof of Theorem (8.16) to show that every formal power 
series )jajp0 Cat” (Ca € C) is the Taylor series of some C® function, 


D. Amplitudes, Adjoints, and Products 
Our definition of pseudodifferential operators is based on the usual prac- 


tice of writing partial differential operators as differentiations followed by 
multiplication by coefficients: 


Pu(z) = J) aa(2)D7u(z) = i ye aa(2)e rae) de. 


However, in some situations such as computing adjoints, it is preferable to 
reverse the order of these operations: 


Qu(z) = )> D*(aqu)(z) = if So c7** Se" (aM ul (6) dé: 
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Setting p(z,&) = )> aa(x)é* and writing out the Fourier transform explic- 
itly, these become 


Pu(z)= // ePti(2-9) € o(x, E)u(y) dy dé, 
Quix) = ff reply, gouty) dy dé. 


This suggests that we could also define pseudodifferential operators by 
using more general symbols p in the last formula for Q. 

Actually, for maximum flexibility it is preferable to allow coefficients on 
both sides of the differentiations. We are therefore led to consider operators 
of the form 


(8.22) Pau(z) = i i e*™(2-¥) €a(x, €, y)u(y) dy dé, 


where a belongs to the class 


A™(Q) = {a €C*(Q x R® x Q) : for every compact BC Q, 
sup |D2D; Deal, €,9)] $ Cayea(l + lel '*'}. 
ny 
The elements of the class A™(Q) are called amplitudes of order m on Q. 
A few remarks on these definitions are in order: 


i. The integral in (8.22) must always be interpreted as an iterated integral 
in the indicated order; it is usually not absolutely convergent as a double 
integral. 


ii. The operator P, defined by (8.22) is to be interpreted, as in the case 
of WDO, as a linear map from C9°(Q) to C(Q). 


ili. The class A™(Q) includes $™(Q) if we regard symbols as amplitudes 
that are independent of the third variable, and hence the corresponding 
class of operators {P, : a € A™(Q)} includes ¥™(Q). We shall shortly 
see that these two classes actually coincide modulo smoothing operators 
and that the properly supported operators in the two classes are the 
same. 


iv. In contrast to the symbol-operator correspondence p — p(z,D), the 
amplitude-operator correspondence a — P, is highly non-injective, a 
fact which is useful in some ways and awkward in others. For example, 
if d1, ¢2 € C™(Q) then the function a(z,&,y) = ¢1(z)d2(y) belongs to 
A°(Q), and Pau = ¢1¢2u. In particular, if supp ¢; Nsupp ¢2 = @ then 
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P, = 0, and if » € C™(Q), there are at least as many amplitudes for 
the operator u — wu as there are ways of factoring 7 as ¢1¢2. 

v. There is another way of defining pseudodifferential operators that deals 
with differentiation and multiplication by coefficients in a more sym- 
metric way, the so-called Wey] calculus; see Folland [15]. 


By an argument like the one in §8B, if a € A™(Q) the distribution 
kernel K of Py is given by 


(K,w) = If e2t2-¥) Eola £,y)w(z, y) dy dé dz (wECcor(Q x Q)). 


In other words, K is given by the inverse Fourier transform of a in its 
second variable: 


(8.23) K(2,y) = az(z, x—y, y), 


interpreted in the appropriate distributional sense. From this it is clear 
that 
supp K C Xa, 


where 


(8.24) Us={(x,y)E€Qx Q: (z,£,y) € suppa for some € € R"}. 


(8.25) Proposition. 
Ifa € A™(Q), there exists b € A™(Q) such that P, is properly supported 
and Pz — Py is a smoothing operator. 


Proof: By Proposition (8.14) we can find a properly supported ¢ € 
C®(Q) such that ¢ = 1 on a neighborhood of the diagonal An. Let 
b(z,é,y) = O(z,y)a(z,é,y). It is easily verified that b € A™(Q). If K 
is the distribution kernel of P,, it follows from (8.23) that the distribution 
kernel of P; is ¢K, which is properly supported since ¢ is. Moreover, the 
same argument that proves Theorem (8.8) shows that K is C® away from 
Ag and hence that (1 — ¢)K, the distribution kernel of P,; — Pj, is C™ 
everywhere. Hence P, — P; is a smoothing operator. | 


(8.26) Proposition. 
If a € A™(Q) and Py is properly supported, there exists b € A™(Q) such 
that: 
a. b(z,€,y) = a(z,&,y) for all € € R" and z,y in some neighborhood of 
the diagonal Ag; 
b. X»y (defined by (8.24)) is a proper subset of Q x Q; 
Cc, P, = Py. 
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Proof: Let K be the distribution kernel of P,. By Proposition (8.14) 
we can find a properly supported ¢ € C™(Q x ©) such that ¢ = 1 ona 
neighborhood of (supp K)U Ap. Set b(2,£,y) = ¢(2, y)a(z,é,y). As in the 
preceding proof, b € A™(Q) and the distribution kernel of P, is 6K = K, 
so P, = P,. Finally, £, is proper since it is contained in supp ¢. | 


Suppose that a € A™(Q) and P, is properly supported. By Proposition 
(8.12) and the discussion following it, P, extends to a continuous operator 
on C™(Q). In particular, we can apply P, to the function Eg(z) = e?#*"€ 
for any € € R*. In what follows, by a slight abuse of notation we shall 
denote the function P,(E¢)(z) by Pa(e?***°*). 


(8.27) Theorem. 
Suppose a € A™(Q) and P, is properly supported. Let 


(8.28) p(z,é) = en 2niz EP,(e2*i€), 


Then p € S™(Q) and P, = p(z, D). Moreover, 


(8.29) r(z,é)~ J 7 of Dgalz,€,W)lyae 


lal20 


Remark: The last statement means, more precisely, that 


co 


1 
P~ >oP; where p;(z,&) = s 98 Dy a(z,€, 9) oe 


2 lel=3 


p, clearly belongs to S™~4(Q). (Similar remarks will apply to the asymp- 
totic expansions in Theorems (8.36) and (8.37) below.) 


Proof: If u € C%°(Q), we write u(x) = f e?***€a(é)dé. As the 
reader may verify (Exercise 1), this Fourier integral is the limit of Rie- 
mann sums )> e?*** Ga(€,)Ag; that converge in the topology of C°(Q) 
(i.e., the sums and their derivatives converge uniformly on compact sets 
to u and its derivatives). Since P, is linear and continuous on C™(Q), we 
have 


Pau(2) = ‘) P,(e?*€)a(€) dé = jf eM Lp(n, El) de, 


so P, = p(t, D). We shall show that p € S™(Q) and that (8.29) holds by 
using Theorem (8.20). 


Pseudodifferential Operators 287 
By Proposition (8.26), we can modify a so that ©, is proper without 


affecting P, or the behavior of a along Ag (which is all that matters in 
(8.29)), and we henceforth assume that this has been done. We then have 


where the inner integral converges nicely since the function y — a(z,é, y) 
belongs to Co°() for each x and €. If we set 


b(z,é,y) = a(x, €, 2 +y) 
this becomes 
p(z,n) = [poe £b(x, Ey — xe)" dn dé 
. [fom (€-" b(n, &, 2) dedé 
s [tate &&-n) dé, 
or 


(8.30) p(2,n) = / Bs(x, +n, €) dé, 


where bs denotes the Fourier transform of 6 in its third variable. 
Since a € A™(Q) and b(z,n,-) € CO° for each x and n, for z in any 
fixed compact subset of 2 we have 
(8.31) [DE Dy bs(z, 7,8) < Ca,en(1 + Il) + 11). 
In particular, 


[DE D%b3(x, € +, E) < Ca,p,n(1 + [E+ al) HAN + Jem. 


The triangle inequality implies that 


1+ [€+n] < (14+ lé)Gtial), 


and raising both sides to the (positive!) power |m| + |a] shows that 


[D8 D%ba(x, € +7, €)| < Ca,o,n(1 + lal) tela + jepinttel-N, 
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If we apply this estimate to (8.30) with N = |m|+|al+n-+1, we see that 
|DE Ds p(z,n)| < Ca,o(1 + [nl)imtlet, 
which is condition (ii) in Theorem (8.20). 


To establish condition (i), we apply Taylor’s theorem to the function 
¢ + a(x, 0 +6,), set ¢ = é, and apply (8.31): 
ta(z,€+n€-— D> azha(e.ne) S| 
lal<k : 


< cE sup \él*|a%b3(z, n+ té, €)| 
laj=k, O<t<1 


Seen sup |€(*(1+ [p+ té|)™-*(1 + [e7%. 
O<t<i 


(These estimates, like the preceding ones, hold for z in a fixed compact 
subset of ©.) If |€| < $]n|, we take N = k and obtain 


ise, E+ — D> as(2, 0.68 


faj<k 
If |€| > $|n|, we take N = 2k + n + max(m — k,0) and obtain 


Bale, €+n, 8) — So aBa(z. nes 


la|<k 


< h(i + Inl)™-*. 


< of(1 + lel)? 


Substituting these results into (8.30) and integrating in polar coordinates, 
we see that 


p(z.n)— >) / ag bs(2,7,8) 5 ae 


lalse 


ee i (14+ [nly ™7# dé + ef | (1+ [ert de 
lél<int/2 fél>In]/2 


< Ce [1+ [n™t"-* + (1+ bnl)*]. 


But 
J esiiale.n. ee" ae = [25 / ev £9%b3(2, 0, €) ag| 
y=0 
= Di95 (2,9; ¥)|ya0 
= Dyd;a(z,7, Wlyaci 
so 


1 na 
p(2,n)— > Dy Aralz.m Wlyee| S 2Ce(l + bal), 
Jal<k 
where py = max(m +n, 0) — k. Condition (i) of Theorem (8.20) is thus 


fulfilled, and we are done. | 
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(8.32) Corollary. 

Ifa € A™(Q), there is a properly supported Q € ¥™(Q) such that P; —Q 
is a smoothing operator. If a € S™(Q) (so that P, = a(x, D)), there is a 
properly supported Q € ¥™(Q) such that a(z,D)—Q € ¥~™(Q). 


Proof: By Proposition (8.25) there exists 6 € A™() such that P, 
is properly supported and Pa — Ps is smoothing, and by Theorem (8.27), 
P, € ¥™(Q); this proves the first assertion. Moreover, the 6 in Proposition 
(8.25) is constructed so that the distribution kernel of P, — P;, vanishes on 
a neighborhood of the diagonal. Thus if a € S”(Q) and q is the symbol of 
Py given by (8.28), by (8.6) (a — g)¥(x,z) vanishes near z = 0 for each z. 
By Theorem (8.8a), (a — q)¥(z,-) is a Schwartz class function for each z, 
with estimates depending uniformly on z for z in a compact set, and hence 
the same is true of (a — q)(z,-). Thus a— q € ¥~™(QN); in other words, 
Pa — Py € W-™(O). i] 


As we observed in §8B, whether P € ¥™(Q) has a unique symbol or not 
depends on the nature of 2. However, if P is properly supported, (8.28) 
gives a canonical choice of symbol for P, and we shall denote it by op: 


(8.33) op(z,é) = en 2riat pi eerie €). 


We can now obtain a perfect correspondence between symbols and opera- 
tors if we follow the philosophy introduced in §8C of regarding symbols and 
operators of order —oo as negligible, i.e., of considering equivalence classes 
of symbols and operators modulo symbols and operators of order —oo. 


(8.34) Corollary. 

The map p -—+ p(z,D) induces a bijection from S™(Q)/S~°(Q) to 
W(Q)/¥--(Q), and the map P -— op induces a bijection from 
W(Q)/W-M(Q) to SP(N)/S~9(Q). These bijections are mutually 


Inverse. 


Proof: Each equivalence class in ¥(2)/W~°(Q) contains a properly 
supported operator, by Corollary (8.32). The assertions therefore follow 
from Proposition (8.11). ] 


We are now in a good position to compute transposes, adjoints and 
products of pseudodifferential operators. The terminology we employ is 
as follows: If T and S$ are linear maps from C2°(Q) to C™(), we say 
that S is the transpose of T and write $ = T’ if (Tu,v) = (u, Sv) for 
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all u,v € CS°(Q); we say that S is the adjoint of T and write S = 
T* if (Tu,d) = (u,Sv) for all u,v € C2(Q). If T has the distribution 
kernel K(z,y), then T’ and T* have the distribution kernels K'(z,y) = 
K(y,2) and K*(z,y) = K(y,z) respectively. It follows that if T is properly 
supported, then so are J’ and T*. 


(8.35) Lemma. 
Suppose p € S™(Q). Then p(z,D)' = Py where a{z,£,y) = p(y, —€), and 
p(x, D)* = Py where b(x,€,y) = p(y, €). 


Proof: We consider only p(z, D)’; the argument for p(x, D)* is essen- 
tially identical. Suppose u, v € C&(Q). Since 


wW(e, Dyu, v) = [ff ane p(x, e)u(e)u(u) dy dé de, 
the proof is just a matter of reversing the order of integration. This is not 
a routine application of Fubini’s theorem, however, since the triple integral 


is (usually) not absolutely convergent. Instead, we apply Fubini’s theorem 
to the absolutely convergent double integral 


(w(t, Dyu, 0) = ff 2" € p(x, g) A )u(e) de dz 


to obtain 
(ole, Du, v) = f (ACE) a8, 


where 


g(€) = fo"? *p(e,€u(2) dz. 


g is a rapidly decreasing function by Lemma (8.5), so we have 
(p(z, D)u, v) = (t, 9} = (u,9). 
Therefore 
p(2, Dy v(u) = 9y) = ff Pe * p(x, 8)0(2) de dg 
= i if e*-#) n(x, —E)u(x) dx dé, 


so that p(z, D)! = P, with a(z,£,y) = p(y, —€) as claimed. 1 
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(8.36) Theorem. 
If P € ¥™(Q) is properly supported, then P’, P* € ¥™(Q), and 


~ p)lel 
op(2,é)~ 5 ap veorlz,-8), 


la|20 
1 —— 
ope(2,)~ )) 8¢ De or(z,€). 
jal>0 


Proof: This is an immediate consequence of Theorem (8.27), Corol- 
lary (8.34), and Lemma (8.35). i] 


(8.37) Theorem. 


If P € W™(Q) and QE w™ (Q) are properly supported, then QP € 
wt’ (Q) and 


cap(2,6)~ D> oF ealz,€)- DEoplz, 6). 
Jalz0 


Proof: Since P = (P’)’, by Lemma (8.35) we have 


Pu(2) = i i) e2Fi(-¥) Eg pi (y, u(y) dy dé, 


that is, 
Pulé) = , e-*V Ep (y, ~€)uly) dy. 


Therefore, 
QPu(z)= | f 2» aa(2, amy, —eu(y) dy a, 
which means that 


QP=Pq where a(z,€,y) = oQ(z,€)or(y, —€). 


Clearly a € Amt (9): and QP is properly supported by Corollary (8.13). 
Hence, by Theorem (8.27), QP € ¥™*+'™'(Q) and 


sar(z,€)~ S> GOP DSloa(2,8)oP(¥, Oyen 


lal20 
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Now, by the product rule, 


d2DF[oq(z.€)or(y,-Oyo2= Do By o [eale, €)]-6f De [op-(z, —€)], 
B+y=a 
80 


1 
sap(z,é)~), ide 792.8) - 87 DE*" fo pi(z, -€)], 
o Biy 


with the understanding that, for each j > 0, all the terms with [6|4|y| =j 
(of which there are a finite number) are to be grouped together to form 
a single term p;(x,€) € sm+m'-j in the asymptotic sum. With the same 
understanding, Theorem (8.36) then implies that 


can(ad)~ 3 oo a aag(z,8)- 8 DE+++ op (2, €) 
By 


14] 
IE CI jit eal) 22D ena 
BA 


+5=r 


But by the binomial theorem, if x = (1,1,---,1) ER", 


—1)!4 1 ifA=0, 
x CP =e o= {5 if \ #0. 


+62 Y 
Therefore, ‘ 
ogp(z,€) om Ss oe 79(™ 8) * D8 op(x,é), 
ra 
and we are done. ] 


S(Q) is a *-algebra (i.e., algebra with involution) under pointwise 
multiplication and the involution p — p. S~™(Q) is a +-ideal, so the 
quotient S°(2)/S~™(Q) inherits the structure of a *-algebra. On the 
other hand, by Theorems (8.36) and (8.37) the set of properly supported 
operators in W°(Q) is a *-algebra under operator composition and the 
involution P —+ P*., Again, the operators of order —oo form a +-ideal, 
so in view of Corollary (8.32), ¥°(Q}/W~~(Q) inherits the structure of 
a *-algebra. The correspondence p — p(z,D) and its inverse P — op 
(as in Corollary (8.34)) are not homomorphisms — after all, the symbol 
algebra is commutative whereas the operator algebra is not — but they 
become homomorphisms if we neglect all but the highest order terms in 
the asymptotic expansions of Theorems (8.36) and (8.37). Indeed, a glance 
at these expansions immediately yields the following result. 
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(8.38) Corollary. 

The byections of Corollary (8.34) are *-algebra homomorphisms modulo 
lower order terms. More precisely: 

a. If P € W™(Q) and Q € Y™'(Q) are properly supported then then 


opg=opoeq (mod gm+m'-1(9)), 


op» =@p (mod S™1(Q)). 


b. If p € S™(Q) and g € S™'(Q) and p(z, D) and q(z, D) are properly 
supported then 


p(z, D)q(z, D) = (pq)(z,D) (mod ¥™+™'-1(Q)), 
p(x, D)* = p(x, D) (mod ¥™-1(Q)). 


There is yet another algebra structure that is preserved by the symbol- 
operator correspondence up to lower order terms. Namely, for C™® func- 
tions on 2 x R” one has the Poisson bracket 


- Op Og _ oq Op 
fra) = > (z Ox; 06; =) , 


which makes S®(Q) and S®°(Q)/S~%°(Q) into Lie algebras. (More pre- 
cisely, if p € S™(Q) and q € S™(Q) then {p,q} € S™+™’-!(Q).) On the 
other hand, by Theorem (8.37) and Corollary (8.32), ¥~(Q)/W~™(Q) in- 
herits a Lie algebra structure from the commutator of operators, [P,Q] = 
PQ— QP. If we subtract the asymptotic expansion for the symbol of QP 
from that of PQ, we see that the highest order terms cancel and that the 
first remaining terms give the Poisson bracket, except for a factor of 271 
becasue one has D, instead of 0,. In other words: 


(8.39) Corollary. 


If P € Y™(Q) and Q € W™'(Q) are properly supported then [P,Q] € 
wmtm'-1(9) and 


1 te 
oP,9] = 5{oP.7q} (mod s™t™ -2(9)). 


Ifp € S™(Q) andge s™'(Q) and p(z, D) and q(z,D) are properly sup- 
ported then 


[p(z, D), q(x, D)| = 2ni{p,q}(z, D) (mod ¥™+™'-2(Q2)). 
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The symbol-operator correspondence is related to the problem of defin- 
ing a correspondence between observables in classical mechanics and ob- 
servables in quantum mechanics. For the latter purpose one should insert 
factors of fi (Planck’s constant) in various places in our formulas, and the 
asymptotic expansions of the theorems in this section then become expan- 
sions in powers of fi. The correspondence between Poisson brackets and 
commutators is of particular importance in this setting. See Folland [15]. 


EXERCISES 


1. Show that if u € 5S, there is a sequence of finite sums of the form 
Sn(z) = So cM et *#& G(EN) such that O°Sy — d%u uniformly on 
compact sets for every a. 


2. Our hypotheses about proper support are sometimes more stringent 
than necessary. For example, the product of two WDO is well-defined 
if at least one of them is properly supported. Formulate and prove ver- 
sions of Theorem (8.37) and Corollaries (8.38) and (8.39) (as corollaries 
of these results themselves) that apply in this more general situation. 


3. Since 0 = 27D, the asymptotic formula for ¢p- in Theorem (8.36) can 
be written formally as 


op-(a,€) ~ 2D Deg (z,€). 


Another interpretation of e?'2=)D¢ is available, as an operator on 
8(IR2") defined by the Fourier transform: 


[e2**P= De Fr(n, y)= e219 Fn, y). 


Show that if P = p(x, D) where p € S(R®") (so P € ¥~™(R*)), then 
ops = e?"iD=-Dey (exact equality!). (Hint: Use (8.6).) 


4. (A product rule for YDO) Suppose p € S™(Q) and v € C™(Q). Show 
that for every positive integer k there exists Ry € wm-k-1(0) such 
that 

1 
p(z, D)(vu) = y sip (Of p)(z, Du + Ryu. 
lolsk 

In certain cases one can obtain an exact formula for the product of two 

WDO, with no error term. The following exercises examine three such 

cases; in all of them, one should work directly with the definition (8.3) 

rather than trying to apply the results of this section. 
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5. Suppose p,q,r € S”(Q), p is independent of €, and r is independent of 
z. What are p(z, D) and r(z, D) in these cases? Show that (pq)(z, D) = 
p(z, D)q(z, D) and that (qr)(z, D) = q(x, D)r(z, D). 


6. Show that the result of Exercise 4 holds with R,; = 0 if v is a polynomial 
of degree < k. 


7. Show that if p € S™(Q), 


D*[p(2, Du] = Sy sea(DEp\(2, DD" a}, 
B+y=a 


E. Sobolev Estimates 


We now state and prove a continuity theorem for pseudodifferential opera- 
tors with respect to Sobolev norms. Our argument is an elaboration of the 
one we used to prove Proposition (6.12). 


(8.40) Theorem. 

Suppose P € ¥™(Q). 

a. P maps H°(Q) continuously into H'°*,(Q) for all s € R; that is, if 
@ €CY(Q) then ||dPull,-m < Cs,gllulls for all u € H2(Q). 

b. If P is also properly supported, P maps H!°°(Q) continuously into 
Hiec (Q) for all s € R; that is, for every ¢ € C2°(Q) there is ap € 
C&(2) such that ||¢Pul]s;-em < C¢,s|lvulls- 


Proof: Let P = p(z,D). Then the map u — ¢Pu is q(x, D) where 
q(z,€) = $(z)p(z,€). To prove (a), it therefore suffices to show that if 
q(z,€) € S™(Q) and q(z,&) = 0 for x outside some fixed compact set B, 
then Q = q(z, D) is bounded from H2(Q) to Hs-m- 

Suppose then that u € H2(Q). Qu is defined by the recipe for the 
action of Q on distributions in §8A, and Qu has compact support — in 
fact, supp Qu C B. It follows that 


Gala) = (Que MO) = ff err g(z, g)a(6) de de 
= | auln—¢, e016) ae, 
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where 9; denotes the Fouricr transform of gq in its first variable. Hence, if 
veé5, 


(Qu|») = (Quls) = i [Ken 940TH 6 dn 


where 


FE) = (1+ (EPG), gn) = (1+ Ln?) Hm), 
K(9,€) = G(n— € E01 + (EP?) 721 + (nO? 


Since q(z,€) has compact support in x, for any multi-index a we have 


"ace = 


/ [Dae 2" “a(x, €) dz 


< Ca(1 + (é1?)"/?, 


i} en2rie Dogz, €) dz 


so that for any positive integer NV, 
au, €)1 S Cw (1 + (67/71 + 1C/?)-%. 
Hence, by Lemma (6.10), 


|K(n.€)] < Cu (1 + €[?)0"- 9/7 (1 + [nl?) O72. + | - €/?)-% 
< Cy(1t In — EP )-N tml? 


If we choose N > $n + |s — ml, we see that 


Jixaoinse, — [iK@eld sc. 
Therefore, by (0.10) and the Schwarz inequality, 


KQu]v)| < Callflizellgllz2 = Csllullsilellm—s, 


so the duality of H,-m and Hm_, implies that ||Qul|,-m < Cs|lull., as 
desired. This proves (a). 

Now suppose P is properly supported, u € H!°(Q), and ¢ € C%(Q). 
By Proposition (8.12) there is a compact B C Q such that the values 
of Pu on supp ¢ depend only on the values of u on B. Thus if we pick 
wv € C(O) with » = 1 on B, we have $Pu = ¢P(pu). By part (a), then, 
(|¢Pulfsem < C,||Wul];, which proves (b). | 
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EXERCISES 


1. Assume 2 = R". Find sufficient conditions on p € S™(R") for p(x, D) 
to be bounded from H, to Hs—m (globally, with no cutoff functions). 


2. The boundedness of the function p is not necessary for the boundedness 
of p(z, D) on Ho = L?. Show, for example, that if p € L?(Q x R") then 
p(z, D) (defined by (8.3), even though p may not belong to S*(Q)) is 
a compact operator on L?(Q). (Hint: Theorem (0.45).) 


F. Elliptic Operators 


A symbol p € S™(Q), or its corresponding operator p(z,D) € ¥™(Q), is 
said to be elliptic of order m if for every compact A C Q there are 
positive constants c4,C, such that 


Ip(z, €)| > cal€l™ for x € A and |é| > Ca. 


This agrees with our previous definition when p(z, D) is a differential op- 
erator; see the remarks at the beginning of §6C. As we did with differential 
operators, when we say that P € ¥™(Q) is elliptic, we shall always mean 
that it is elliptic of order m. 

In this section we shall show that elliptic pseudodifferential operators 
are invertible in the algebra ¥™(Q)/Y~-™(Q); this will yield an easy proof 
of the elliptic regularity theorem for pseudodifferential operators and a 
proof that elliptic differential operators are locally solvable. We shall also 
derive a version of Garding’s inequality for pseudodifferential operators. To 
begin with, we state the following technical lemma, whose proof we leave 
to the reader (Exercise 1). 


(8.41) Lemma. 

If p € S™(Q) is elliptic, there exists ¢ € C™(Q x R") with the following 
property: For any compact A C Q there are positive constants c,C' such 
that for z € A we have 

a. ¢(2,£) = 1 when |é| > C; 

b. |p(z,€)| 2 clE/™ when C(x, £) # 0. 


If L is a differential operator on Q, a left (resp. right) parametrix for 
L is usually defined to be an operator T (not necessarily a YDQ), defined 
on some suitable space of functions or distributions on Q, such that TL —TJ 
(resp. LT — I) is a smoothing operator. In our context, we shall define 
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a parametrix for a YDO P € ¥©°(Q) to be a properly supported WDO 
Q € ¥(2) such that PQ—I € Y~™/(Q) and QP—I € W-™(Q). (Here and 
in the following discussion, we may modify any ¥DO by adding an element 
of Y-™(Q) to make it properly supported, as the need arises. This has no 
effect on our calculations, which are all performed modulo ¥~™(Q).) 


(8.42) Theorem. 
If P € W™(Q) is elliptic, P has a parametrix Q € ¥~™(Q). 


Proof: Let P = p(z, D), and let ¢ be as in Lemma (8.41). Let go = 
¢/p, with the understanding that go = 0 wherever ¢ = 0. Since go(z,£) = 
1/p(z,€) for large € and p is elliptic, it is easily verified (Exercise 2) that 
qo € S~™(Q); moreover, ggp—1 has compact support in € and hence belongs 
to S~™(Q). Let Qo = go(z, D); then by Corollary (8.38), 


SQoP = Gop (mod S~'()) 
= 1— 1; where ry € S71(Q). 
Let qi = Cri/p = rigo € S~™73(Q) and Qi = qi(z,D). By Corollary 
(8.38) again, 
vg.p =p (mod S~?(2)) 
= 11 ~ 12 where rz € S~7(Q), 


and hence 
F(Q04Q1)P = %QoP +oQ.P = l—ro, r2€ S-2(Q). 


We now construct q; inductively for 7 > 2. Having constructed q; € 
S~™-3(Q) for j < k so that (with Q; = q;(z, D)) 


(Qo+-4Q,)P=1—rj41, 41 E S7I7*(Q), 
we set qx = Cre/p = reqo € S~™-*(Q) and Qy = qk(z, D). Then 


eq.P =p (mod S~*-1(Q)) 


=Te—Tk4ei where Tei € SoRN(O), 


and hence 


O(Qo+ -+Qu)Pp = (L— re) + (re — ret) =1—regs, tees € S7*77(Q). 
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Now, by Theorem (8.16) and Corollary (8.32), there is a properly sup- 
ported Q = q(x, D) such that q¢ ~ 57>° qj, and for any k we have 


Cap —1= 4(Qo4-- +Q1)P —1 (mod S~*-*(Q)) 
=0 (mod S-*-1(Q)), 


so QP —I € ¥~-™(Q). In exactly the same way, we can construct Q! € 
W-™(Q) such that PQ’ —I € ¥-™(Q). But then 


PQ- I= (PQ’-1)+ P(QP— DQ — PQ(PQ' ~ 1) € H™(Q), 
so Q is a parametrix for P. i] 


As an immediate corollary, we obtain a generalization of the elliptic 
regularity theorem (6.33) to pseudodifferential operators. We shall derive 
a further refinement of this result in §8G. 


(8.43) Theorem. 

Suppose P € ¥™(Q) is elliptic and properly supported, and u € D/(Q). 
If Pu € H'*(Q), then u € HIS,,(Q). In particular, if Pu € C°(Q) then 
uEec~m(O). 


Remark: The hypothesis of proper support can be dropped if one 
assumes u € €'(Q). 


Proof: Let Q € Y~™(Q) be a parametrix for P. We have QPu € 
H's¢,,(Q) by Theorem (8.40), and (I - QP)u € C™(Q) since I- QP € 
W-°(0). Hence u = QPu+(I—-QP)ue H's, (2). The second assertion 
follows from Corollary (6.7). a 


We now derive the local solvability theorem for elliptic operators. First, 
a technical lemma. 


(8.44) Lemma. 

Suppose X is an M-dimensional subspace of C2°(IR") (0 < M < 00), and 

zo ER”. Fore > 0, let W, = R" \ B,(z0). 

a. There exists « > 0 such that the restriction map h — h|W, is injective. 

b. Let € be as in (a). For any f € €'(IR") there exists 9 € C&°(W,) such 
that (f —g, h} = 0 forallhe X. 
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Proof: Let hy,...,h be a basis for X. If (a) is false, for each k > 


1 there are constants b&,...,bk, such that max, [b*,| = 1 and >> bE hy» 
is supported in B,/;(t0). By passing to a subsequence we may assume 
that limyoo b¥, = bm exists for each m. But then maxm |bm| = 1 and 


>> bmhm = 0 (since supp(S> bnhm) C {z0}), which is impossible since the 
hm’s are linearly independent. 

To prove (b), let ¥ = {A|W, : h € X} with c asin (a). Then Y is a Hilbert 
subspace of L?(W,) since X C C&%° and dimY < oo, and {hm|W.}™_, is a 
basis for Y. The elements of the dual basis for Y can be approximated in 
L?(W,.) by functions in C°°(W,); hence, there exist gi,..., gu € C?(W.) 
such that the matrix ((g:,4m)) is nonsingular. Given f € €’, then, there 
are unique constants c,,...,cyg such that )>) c:(g1, 4m) = (f, Am) for m= 
1,...,.M, so we can take g = )> mgm. 1 


(8.45) Theorem. 

Suppose P is an elliptic differential operator with C™ coefficients on 2. 
Every x9 € Q has a neighborhood U C Q such that the equation Pu = f 
has a solution on U for every f € D'(Q). 


Proof: Let W be an open set such that 79 EW CWCQ and W is 
compact, and pick @ € C&°(Q) with @ = 1 on W. We first observe that it 
suffices to consider f € €/(Q2), for if f € D/(Q), a solution to Pu = of on 
a neighborhood U of zo with U C W also satisfies Pu = f on U. 

Let Q be a parametrix for P, and let S = PQ—J. Since P and Q 
are properly supported, so is S; hence, by Proposition (8.12) there is a 
compact A C Q such that the values of Su on W depend only on the 
values of u on A. Pick » € C2°(Q) such that y = 1 on A, and define 
the operator T by Tv = ¢S(yv); then T € W~™(R") (it is given by the 
amplitude a(z, €,y) = ¢(z)o5(z, €)¥(y)) and Tv = Sv on W for any v. If 
for any f € €/(IR") we can find v € D’(R”) such that (T+ J)v = f ona 
neighborhood U C W of zo, we are done. Indeed, let v = v|Q; then on U 
we have PQU = (S+1)¥ =(T + I)v = f, so u = QU does the job. 

Suppose then that f € &/(R"). By Corollary (6.8), f € Hs for some 
s ER. The range of T consists of C® functions supported in supp ¢, so by 
Theorem (8.40) and Rellich’s theorem (6.14), T is compact as an operator 
on H,. By Corollary (0.42), the equation (T+J)v = f has asolution v € H, 
provided that f is orthogonal in H, to the space N= {g: (T7 + I)g = 0}, 
where T; denotes the adjoint of T with respect to the inner product on 
Hy. 


Pseudodifferential Operators 301 


Now, for g € N we have g = —Tyg, so 
~(fla)s = (F1T79)s = (TF lg). = (A’T SIA’ g)o = (Ff, T*A2*9), 
where A® is defined by (6.4) and 7™ is the adjoint of T as defined in §8D. 
Thus f 1 N if and only if (f,4) = 0 for all A in the space X = {T*A?%g : 
9 € N}. Note that X C C® since T*v = S* (dv) and S* is smoothing. 

In short, there exists vu € H, such that (T + I)v = f if and only if 
(f,h) = 0 for all A in X, a finite-dimensional subspace of C°. This may 
not be the case, but by Lemma (8.44), for some € > 0 there is always a 
g € C& with support disjoint from B,(x9) such that (f — g, h) = 0 for all 
h € X, and then there exists v € H, such that (T+ J)v = f ~g. But then 
(T + I)v = f on B,(zo), so we can take U = B,(zo) NW. | 


We now turn to Garding’s inequality for pseudodifferential operators. 
We shall say that a symbol p € S™(Q), or its corresponding operator 
p(z, D), is strongly elliptic if for every compact A C 2 there are positive 
constants c, C’ such that 


(8.46) Re p(z, ) > c(1 + |é|?)"/? for z € A and |é| > C. 


(This definition differs slightly from the one in §7A when p(z, D) is a differ- 
ential operator; the set we called © there corresponds to the A here. The 
possible inclusion of lower order terms in p and the replacement of |€|™ by 
(1 + [€]?)™/? are of no consequence since (8.46) is to hold only for large 


IéI-) 


(8.47) Theorem. 
Suppose P € ¥™(Q) is strongly elliptic, and let Po = $(P + P*). There 
exists a properly supported Q € ¥™/?(Q) such that Py — Q*Q € W-™(Q). 


Proof: The argument is very similar to the proof of Theorem (8.42). 
Let P = p(z, D) where p satisfies (8.46), let ¢ be as in Lemma (8.41) with 
p replaced by Rep (possible since Rep is elliptic), and let 


qo(z,€) = C(z,€)[Rep(z,é)}'/7, q(x, €) = C(x, €)[Rep(z, €)]71”. 


It is easy to verify (Exercise 4) that go € s™/2(Q) and g® € S-™/2(Q), 
and goqg° — 1 is in S~™(Q) since it has compact support in €. Moreover, 
by Corollary (8.38), 


FPo~Q3Q0 = Rep a“ % (mod s™-1(9)) 
rp €S™1(Q). 
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Procceding inductively, for j > 1 we can find real qg; € S(™/2)-3(Q) such 
that if Q; = q;(z, D), 
Po ~ (Qo +++ Qi-1)(Qo +--+ + Qe-1) = re(z,D), re E S™*(Q). 
Indeed, if we have found q;,..-,qe¢-1, We observe that r.(z, D)* = rz(x, D) 
and hence, by Corollary (8.38), re — 7% € S™-*-1(Q). Thus, if qx € 
S(m/2)-k(Q) is real and Qy = qz(z, D), Corollary (8.38) gives 

FPo-(Q5t - +23 Qot+ Qe) 

= re — 2qe(got +++ +4%-1)—G¢ (mod S™-*-1(0)) 

= Rerz —2qoqe (mod S™-*-1(Q)). 
Hence we can take q, = 4q°(Re Tk). 

Finally, by Theorem (8.16) and Corollary (8.32), there is a properly 


supported Q = q(z, D) € ¥™/?(Q) such that g ~ )7§° qj. Then Po—Q*Q € 
W~-(Q), and we are done. 1 


(8.48) Garding’s Inequality. 
Suppose p € S™(Q) satisfies (8.46) and W is an open set with compact 
closure in Q; let ¢ be the constant cy in (8.46). For any « > 0, any 


s< im, and any open set V with V C W there is a constant C > 0 such 
that 


Re(p(z, D)ulu) > (c—e)llulliay2 — Cllulls, = we CCV). 


Proof: Replacing 2 by W, we may assume that Re p(z,&) > clé|™ for 
all z € Q and |¢| sufficiently large. Let p(x, €) = p(z,€)—(c—e)(1+ [é[?)™/?. 
Then p € S™(Q) and 7 satisfies (8.46) with cq = € for all A, so we can 
apply Theorem (8.47) to P = p(x, D) to obtain Q € y7/2(Q) with R= 
Po — Q*Q € ¥~~(Q). On the other hand, P = p(z, D) — (ce — e)A™ where 
A™ is defined by (6.4), so if u € Co°(V), 

Re(p(z, D)u, |u) = Re(Pu|u) + (c— €)(A™u| u) 
= (Pouju) +(c—)(A™/2u[ A™?2u) 
= (Q*Quju) + (Ru|u) +(c—e)llulli- 
But (Ru|u) = (¢Ru|u) whenever ¢ € CH°(Q) and ¢ = 1 on V, so by 
Theorem (8.40), 
|(Ru|u)| < ||@Rull-.llulls < Cllulls. 
Also, (Q*Qu|u) = (Qu| Qu) > 0, so 
Re(p(z, D)u|u) > —|(Ru|u)|+(c—e)flullnaye = (e—e)llulliny2 —Cllulls. @ 
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The original Garding inequality (7.15) follows immediately by taking 
the m and QQ in (7.15) to be 2m and V in (8.48), for the estimate in (8.48) 
for u € C&(V) clearly extends to u € AD 2(V)- 

Garding’s inequality implies that if P = p(z, D) where p satisfies the 
strong positivity condition p(z,&) > el|é|", the symmetrized operator Po = 
i(P + P*) is a positive operator modulo a lower order error term. One 
may ask if a similar conclusion holds under the weaker hypothesis that 
p(x,é) > 0. That is, is it true that 


(8.49) p20 => Re(p(z,D)ulu)>—Cljull? (we CX) 

for some s < 4m? If this were true, we could take « = 0 in (8.48) — 
just apply (8.49) to p(z,é) = Re p(z, €) — c(1 + |é|?)™/?. For this reason, 
an inequality of the type (8.49) with s < im is called a “sharp Garding 
inequality”. 

The arguments above are insufficient to yield such a result. (The proof 
of Theorem (8.47) breaks down if we merely assume that p > 0 because the 
square root function is not differentiable at the origin.) Moreover, simple 
examples (see Exercise 5) show that one should not hope for (8.49) to hold 
for s < }m—1. In fact, (8.49) with s = 3m-— 4} was proved by Hérmander 
and extended by Lax and Nirenberg to YDO acting on vector-valued func- 
tions; this is what is usually called the sharp Garding inequality. For 
four different proofs of this result and a number of important applications, 
see Folland [15], Hormander (27, vol. III], Taylor [48], and Treves [53, vol. I]. 
Moreover, Fefferman and Phong have proved the much deeper result that in 
the scalar case, (8.49) holds with s = im— 1; see Hormander (27, vol. III]. 


EXERCISES 
1. Prove Lemma (8.41). (Hint: Use the ideas in Exercise 5, §8B.) 


2. Show that if p € S™(Q) is elliptic and ¢ is as in Lemma (8.41), then 
¢/p € S-™(Q). 


3. Show that if P € ¥Y™(Q) has a parametrix Q € ¥~™(Q), then P and 
Q are elliptic. 


4. Show that if p € S™(Q) is real and strongly elliptic and ¢ is as in 
Lemma (8.41), then ¢p!/? € S™/?(Q) and Cp7!/? € S-™/2(Q), 


5. Let p € S?(IR) be given by p(z,£) = 41727€?, so that p(z, D)u(z) = 


~—z7u" (2). 
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a. Show that for u € CS(R), 


Re(p(z, D)u|u) = five) + du(z)|? dz — ; } |u(x)|? da. 


b. Show that for any ¢ > 0 there exists u € CO°(IR) such that 
if jxu'(x) + $u(z)|? dz < ef ju(x)|? dz. 


Hint: Pick ¢ € C&°(R) with ¢(z) = 1 for z € [4,2] and supp¢ C 
[5,4], and take u(x) = 2~'/?(2*) where 6 is a small constant times 
€. (The motivation is that f(x) = 2~1/? satisfies zf'+1f = 0, and 
u is a truncated version of f.) 

c. Conclude that the estimate Re(p(z, D)u|u) > —3{|ull2 is sharp. 


G. Introduction to Microlocal Analysis 


One of the principal applications of pseudodifferential operators is in mak- 
ing a detailed analysis of singularities of functions and distributions, in 
which one considers not only the places where a distribution is singular 
but the directions in which it is singular. (For example, f(x,y) = [y| is 
perfectly smooth in the z direction but is not C? in the y direction along 
the line y = 0.) This idea of examining things locally both in position 
space and in direction space goes under the name of microlocal analy- 
sis. In this section we give an introduction to some of the basic ideas and 
techniques of microlocal analysis — in particular, the notion of the wave 
front set of a distribution. 

Roughly speaking, the wave front set of a distribution u is the set of all 
(2, €) such that u fails to be smooth at z in the direction €. To make this 
more precise, suppose for the moment that u has compact support, so @ isa 
smooth function. We take the statement “u is smooth in the direction &,” 
where 5 # 0 € R", to mean that Uu(t€) is rapidly decreasing as t + +00 
for all € in some neighborhood of 9. For a general distribution u, then, we 
take the statement “u is smooth at zo in the direction &)” to mean that 
for some ¢ € Cf° with ¢ = 1 near zo, du is smooth in the direction £9. 
That these notions are reasonable may be seen by considering some simple 
examples; see Exercise 3. 

For reasons of efficiency, we shall proceed somewhat differently. We be- 
gin by introducing some terminology and lemmas concerning the microlocal 
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behavior of pseudodifferential operators. We then define wave front sets in 
terms of pseudodifferential operators and prove a number of results con- 
cerning them, and and finally we derive the characterization sketched above 
as a theorem. In this section, the hypothesis that a pseudodifferential op- 
erator is properly supported will be assumed wherever it is convenient, 
without explicit comment. 

If Q is an open subset of R", we shall set 


TN=Nx {E€ER":EZ40},  TN=OQx {€ ER": E[= 1}. 


(In coordinate-invariant terms, T°Q is the cotangent bundle of Q with 
the zero section removed, and T'Q is the co-unit sphere bundle. See the 
discussion at the end of §8H.) A subset U of R”\ {0} is called conic if 
1€ €U for all t > 0 whenever € € U, and a subset V of T°Q is called conic 
if (z,t€) € V for all t > 0 whenever (z,é) € V. Note that a conic set in 
T°Q is completely determined by its intersection with T!Q. 

A pseudodifferential operator P = p(x, D) € ¥™(Q) is called elliptic 
(of order m) at (zo, 0) € J°Q if there are positive constants c,C and a 
conic neighborhood V of (zo, 9) such that 


Ip(z,€)| 2 elé|™ for all (x, €) € V with |é| > C. 


The characteristic variety of P, denoted by char P, is the complement 
of the set on which P is elliptic: 


char P = {(z,€) € T°: P is not elliptic at (x, £)}. 


At the other end of the scale, P = p(z,D) is called smoothing at 
(zo, €0) if p(x, €) is rapidly decreasing on a conic neighborhood of (zo, £0), 
that is, if there is a conic neighborhood V of (zo, 0) such that 


|DE Dg p(z,€)| $ Cata,a(1 + |é|)~™ for all M > 0, (2,€) € V. 


The microsupport of P, denoted by psupp P, is the complement of the 
set on which P is smoothing: 


psupp P = {(z,€) €T°Q : P is not smoothing at(z,€)}. 


char P and psupp P are both closed conic sets. Their union is always 
T°Q since P cannot be smoothing and elliptic at the same point, but in 
general they have a large overlap. It is easy to see that this definition of 
char P agrees with our previous one when P is a differential operator; on 
the other hand, ysupp P is of no interest when P is a differential operator. 
See Exercise 1. Incidentally, we follow Treves [53] in using the term “mi- 
crosupport.” psupp P is called the “essential support” of P in Taylor [48] 
and the “wave front set” of P in Hérmander [27]. 
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(8.50) Proposition. 
For any P,Q € ¥™(Q), 


psupp(PQ) C (usupp P) NM (usupp Q). 


Proof: It follows easily from Theorem (8.37) that opg is rapidly de- 
creasing on any conic set where either op or cq is. 1 


(8.51) Proposition. 
psupp P = @ if and only if P € ¥-™(Q). 


Proof: Suppose psupp P = @. If A C 2 is compact, each (z,€) € 
A x $,(0) has a conic neighborhood on which the symbol of P satisfies 
estimates of the form |D8 Dgep(z, €)I < C14 |él)7”%. A x 5,(0) can 
be covered by finitely many such neighborhoods, and it follows that op 
satisfies such estimates globally on T°A; thus P € V-™(Q). The converse 
is obvious. i] 


(8.52) Proposition. 

Suppose V is a conic neighborhood of (z0,€) € T°. There exists p € 
S5°(Q) such that p > 0, suppp C V and p(z,€) = 1 for all (x,£) in some 
conic neighborhood of (zo, 9) with |€| > 1. In particular, p(x, D) is elliptic 
at (to,€0) and psupp p(x, D) CV. 


Proof: Since V is conic, we may as well assume that |f| = 1. Let A 
and B be compact neighborhoods of zo in 2 and &o in S,(0) = {€ : |€| = 1} 
such that Ax B C VNT'Q. Pick ¢ > 0 in C&(Q) with suppd C A 
and ¢ = 1 on a neighborhood of zo, and pick » > 0 in C™(5,(0)) with 
supp? C B and p = 1 on a neighborhood of &. Further, pick ¢ > 0 in 
C™(R) such that ¢(t) = 0 for t < $ and ¢(¢) = 1 for t > 1. Then set 


p(2,€) = o(x)(lél-*€)¢ (IE). 


p © S°(Q) because it is homogeneous of degree 0 for large € (as defined in 
§8A) and C@™ for small € (by virtue of the factor of ¢). 1 


We are now ready to define the wave front set of a distribution u € 
D'(Q): 
WF(u)= ( \{char P :P € W°(Q) and Pue C™}. 


W F(u) is a closed conic subset of T°. The intuition behind it is a microlo- 
cal version of the elliptic regularity theorem which we shall prove below: If 
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P is elliptic at (29, >) and Pu is smooth then u must be smooth at (zo, £0); 
hence WF(u) consists of the points zo and directions £9 where u fails to 
be C™. 

Note. In the definition of WF(u), char P is the characteristic variety of 
P as an operator of order zero; if P € ¥Y™(2Q) for some m < 0 then char P = 
T°Q, so only operators of true order zero are significant. Moreover, the 
condition P € Y°(Q) in the definition of WF(u) is merely a convenient 
normalization; we could equally well use WDO of other orders. Indeed, 
if P € ¥™(Q), choose an elliptic Q € Y~™(Q) (for example, Q = A~™ 
as defined by (6.4) or a properly supported variant thereof); then QP € 
W°(Q), and P is elliptic at (xo, €o) (as an operator of order m) if and only 
if QP is elliptic at (zo, 0) (as an operator of order 0), by Theorem (8.37). 

If our definition of WF(u) is to do the job it was designed for, the 
projection of WF(u) onto 2 should be the set of points at which uw is not 
C@™, namely, the singular support of u. We now show that this is the case. 


(8.53) Theorem. 
Let x: T°Q — Q be defined by m(z,€) = x. Ifue€ D'(Q), then 7[WF(u)] = 
sing supp wu. 


Proof: Ifzo ¢ sing supp u, there exists ¢ € Co°(Q) such that ¢(zo) = 
l and ¢du € C™”. The operator Pv = ¢v is p(x, D) where p(z,£) = ¢(z); 
hence P € ¥°((2) and P is elliptic near zo. In other words, Pu € C® and 
(to,€) ¢ char P for all € # 0, so zo ¢ [WF (u)]. 

Conversely, suppose zo ¢ t[WF'(u)]. For each unit vector € there exists 
P € ¥°(Q) that is elliptic on a conic neighborhood of (zo, €) such that Pu € 
C®, By compactness of the unit sphere, we can find Py,... Py € ¥°(Q) 
such that (i) P;u € C™ for all j and (ii) for each € # 0 there is a j such 
that P; is elliptic on a conic neighborhood of (zo, £). Let P = wa P? Pj. 
Then P is elliptic on a neighborhood V of zo in 2 because op = a lop, |? 
(mod S~!(Q)), and Pu € C®. By Theorem (8.43) (with © replaced by 
V), u€ C™(V), so zo ¢ sing supp u. i] 


If P is a WDO, we showed in Theorem (8.9) that sing supp Pu C 
sing supp u for any distribution u. We now derive the microlocal refine- 
ment of this result: 


(8.54) Theorem. 
If P € ¥™(Q) and u € D'(Q) then 


W F(Pu) C WF(u) Npsupp P. 
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Proof: If (zo,£0) ¢ usupp P, by Proposition (8.52) we can find Q € 
W°(Q) such that Q is elliptic at (zo, 0) and psupp PO psupp Q = @. Then 
QP € ¥-~(Q) by Propositions (8.50) and (8.51), so QPu € C™. But this 
means that (r0,£0) € WF(Pu). 

On the other hand, suppose (zo,fo) ¢ WF(u), so there exists Q € 
W°(Q) such that Qu € C™ and Q is elliptic on a conic neighborhood V 
of &. We may assume that Reag > C > 0 for large € in some conic 
neighborhood of (zo, £0). (Replace Q by Q*Q if necessary; this works since 
gq = |og|? (mod S~*(Q)) by Corollary (8.38).) 

Claim: There exist R,S € W°(Q) such that (zo, o) ¢ char R and RP — 
SQ & ¥-™(Q). 

Granting the claim, we see that SQu € C™ since Qu € C™ and (RP— 
SQ)u € C™ since RP — SQ is smoothing, so RPu € C™; hence (zo, £0) ¢ 
WF(Pu). 

To prove the claim, pick an elliptic operator Qe W°(Q) such that 
5 = og ona conic neighborhood U of (zo,60). (The assumption that 
Reog > C near (zo, ) makes this easy: take 05 =oQt+ 1 — p where 
p is as in Proposition (8.52).) Then, by Proposition (8.52) again, pick 
R € W°(Q) with (9,0) ¢ char R and supp R CU. Finally, let T be a 
parametrix for Q and set S = RPT. Since psupp RN psupp(Q -Q)=2, 
we have RPT(Q — Q) € ¥~(Q) by Propositions (8.50) and (8.51), so 


RP — SQ = RP(I— TQ) + RPT(Q- Q) € ¥-*(Q), 
and the claim is proved. I 
As an immediate corollary, we obtain the microlocal refinement of the 


C@™ regularity theorem for elliptic operators. (For the extension of this 
result to Sobolev spaces, see Exercise 2.) 


(8.55) Corollary. 
If P € ¥™(Q) is elliptic, then WF(Pu) = WF(u) for any u € D'(Q). 


Proof: Let Q be a parametrix for P; then QPu—u € C™, so 
WF(u) = WF(QPu). By Theorem (8.54), then, WF(Pu) C WF(u) 
and WF(u) = WF(QPu) C WF(Pu). L] 


Finally, we derive the more intuitive characterization of wave front sets 
alluded to at the beginning of this section. 
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(8.56) Theorem. 
Suppose u € D‘(Q). Then (20,60) ¢ WF(u) if and only if there exist 
@ € C%(Q) such that (zo) = 1 and a conic neighborhood V of € in 
R" \ {0} such that 


sup + |€1)” |(¢uy(E)| < 00 for all M > 0. 
€ 


Proof: Ifsuch a ¢ and V exist, choose p € C™(R") such that supp p C 
V, p = lonaconic neighborhood of £9, and p € S°(Q) as a function of (2, £) 
that is independent of z. (Cf. the proof of Proposition (8.52).) Then p(@u)y- 
is rapidly decaying at infinity, so its inverse Fourier transform p(D)[du] is 
C®. But the operator Pv = p(D){¢v] is a VDO of order zero whose symbol 
equals ¢(x)p(€) (mod S~1(Q)), so (zo,€0) ¢ char P and Pu é€ C™. Thus 
(20,0) ¢ WF(u). 

Conversely, suppose (to,£0) ¢ WF(u). There is a neighborhood N of 
zo in such that (z, fo) ¢ WF(u) for all z € N. Pick ¢ € C%(N) with 
¢(zo) = 1, and let 


x= {€:(2,é) € WF (du) for some z}. 


Since WF(¢u) C WF(u) Nsupp ¢ (this is a simple special case of Theo- 
rem (8.54)), 0 ¢ X. Moreover, E is a closed conic set, for if £; € © for 
j > land &; — €, there exist z;, necessarily in the compact set supp ¢, 
such that (2;,é;) € WF (¢u); by passing to a subsequence we can assume 
that z; — 2x, so (x,€) € WF(¢u) and hence € € X. Hence, as in the 
preceding paragraph, we can find p(é) € S°(Q) such that p= 1 on a conic 
neighborhood of &) and p = 0 on a conic neighborhood of &. But then 
psupp p(D) NWF (¢u) = @, so p(D)[¢u] € C™ by Theorems (8.54) and 
(8.53). 

Moreover, by Theorem (8.8a), the inverse Fourier transform p” of p 
agrees with a Schwartz class function outside a neighborhood of the origin, 
and the estimate (0.32) then implies that p(D)[¢u] = (¢u) * pY agrees 
with a Schwartz class function away from supp ¢. Combining this with the 
fact that p(D)[¢u] € C™, we see that p(D)[¢u] € S$. But then p(¢duy = 
(p(D)[¢u]Y € §, so (guy is rapidly decreasing on the set where p = 1. 
Since this is a conic neighborhood of £ , we are done. | 


EXERCISES 
1. Let P = p(z, D) be a differential operator on 2. 
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a. Show that the definition of char P in this section agrees with the 
one in §1A. 

b, Show that psupp P = T°Q unless there is a nonempty open U C2 
such that p(z,é) = 0 forz EU. 

2. Suppose u € D/(M) and s € R. If (z0,€0) € T°, we say that 
u € H,(zo, 0) if there exist ¢ € CO°(Q) with ¢(zo) = 1 and a conic 
neighborhood V of > in R"\ {0} such that (1+ l€|?)*/2$u(€) is square- 
integrable on V. 

a. Show that u € H,(ao, £0) if and only if u = v + w where v € H9(Q) 
and (20, €0) ¢ WF(w). 

b. Suppose P € ¥™(Q) is elliptic of order m. Show that u € H;(zo, £0) 
if and only if Pu € Hs—m(o, &0).- 

3. Let n= k+m, and regard R" as R* x R™ with coordinates (z, y) and 
dual coordinates (€,77). Let x, be Lebesgue measure on R*, regarded 
as a distribution on R® ((yx,¢) = f (2,0) dz). 

a. Show that if 6 € C2, (dz) (E,n) = (£). 
b. Conclude from Theorem (8.56) that if f Ee C™, 


WF (fue) = {((2,0), (0,9): 2 € supp f, 7 £ Of. 

c. More generally, suppose u € D’(IR*), and let s(u) be the injection 
of u into R" ((i(u), 6) = (u, g[IR*)). Show that WF(z(u)) is the set 
of all ((z,0), (€,17)) with z € supp u, (z,£) € WF(u), and 7 £0. 

4. Define u € D’(IR) by u = 716 — P.V.(1/z), that is, 
(u, ¢) = wid(0) — lim ot) dz. 
er |z|>¢ z 


Show that WF(u) = {(0,£) :€ > 0}. Hint: Use Exercise 1, §5F. 


H. Change of Coordinates 


In this section we shall show how pseudodifferential operators behave under 
smooth changes of coordinates. Here is the setup: We suppose that 2 and 
! are open sets in R® and that F : 2! — Q is a diffeomorphism, that is, a 
C®@ bijection with a C™ inverse. We denote the Jacobian matrix of F by 
Jp (i.e, Jp = (Oy; /Oz,) where y = F(z)) and its adjoint (transpose) by 
Jp. if P is a properly supported YDO on Q, we transfer P to an operator 
on functions on 12’ by setting 


PFu=[P(uo F7})o F, 


and our aim is to show that P¥ is a VWDO on 1’. 
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(8.57) Lemma. 

With notation as above, there exist a neighborhood N of the diagonal Agy 
and a C™® map p from N to the set of invertible n x n matrices such that 
p(2,2) = Jp(2) for all z and 


F(z) — F(y) = w(z,y)(2 — y), (z,y) EN. 


Proof: The set No of all (z, y) € 2’ x ' such that the line segment 
from x to y is contained in 2! is a neighborhood of Aqy. If (z,y) € No, by 
the chain rule we have 


ld 
F(2)— FW) = ff) GP(te + (1 )w) de = le, Me - v) 
where : 
ule») = [ Ir (tz + (1—t)y) dt. 


Since Jr(tz + (1 — t)y) is close to Jr(z) when y is close to z, and Jr(x) 
is invertible for all 2, there is a neighborhood N C No of Am on which 
(x,y) is invertible, so we are done. 1 


(8.58) Theorem. 

Suppose F ; Q! — QQ is a diffeomorphism, P € ¥™(Q) is properly sup- 
ported, and PF : C9°(9') + C™(9’) is defined by PF u = [P(uo F~!)joF. 
Then PF € ¥™(Q'), and 


opr(2,€) = op(F(a), [Jp(z)|-'€) (mod S™~*(')). 


Proof: First suppose that m < —n. Let N and y be as in Lemma 
(8.57). By Proposition (8.15), we can find ¢ € C®(Q’ x 0’) such that ¢ = 1 
on a neighborhood of Ag: and supp ¢ is proper and contained in N. We 
then have 


PFu(x) = i; | e2™(F(2)—2) (F(x), €)u(F~1(z)) dz dé 


=| | etF(2)-FW)) En F(x), €)uly)| det Jr(u)| dy dé 
= Qu(z) + Ru(z), 
where 


Qu(z) = if i e2tin(eue—¥) € oF (x), €)u(y)| det Jp(y)|d(z, y) dy dé, 


Ru(z) = i) | e2(F(2)- FW) Ep (F(x), €)u(y)| det Jn(y)|[1 — o(z, y)] dy dé. 
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We first dispose of the error term R. With notation as in (8.6), its 
distribution kernel is 


K(z,y) = py (F(z), F(x) — F(y))| det Jr(y)|[1 — (2, y)]. 


Now, py (F(z), F(z)— F(y)) is smooth away from z = y by Theorem (8.8), 
and it has compact support as a function of y for each z since P is properly 
supported. Hence K € C® and K(z,-) € C& for each z, and it follows 
that the function 


r(ag)= fe EK(2, 2— 2) de 


belongs to S~™('). But R= r(z, D) by (8.6), so RE WM (0). 

Returning to the main term Q, let v(z,y) be the inverse transpose of 
p(z,y). Since we assumed m < —n, the double integral defining Qu(z) 
is absolutely convergent, so we may reverse the order of integration, make 
the substitution € = v(x, y)n, and restore the original order to obtain 


Qu(r) 
= [fore mnre, v(z,y)n)| det Jp(y)| | det v(x, y)|6(z, y)u(y) dy dn 


=/f e™(2-¥) a(x, 0, y)u(y) dy dn, 
where 


a(z,7,y) = p(F(z), v(z, y)n)| det Jr(y)|| det v(z, y)| A(z, y). 


A simple calculation involving the chain rule shows that a € A™(’). (The 
point to be noted is that applying, for example, D,, to p(F(x),v(z, y)n) 
gives Vyp( F(z), v(x, y)n) - [Dy;v(2,y)n); the first factor is dominated by 
(1+ |n|)™7}, and the 7 in the second factor brings it back to (1 + |n|)™.) 
Q us properly supported because of the factor of ¢ (cf. (8.23) and (8.24)), 
so it follows from Theorem (8.27) that Q € ¥™(’) and that 


oq(z,n) = p( F(z), (2, 2)n)| det Jp(z)||detv(z,z)| (mod sm’). 


But v(z,rz) = [J}(r)]~!, so the expression on the right reduces to 
p( F(z), [Jp(z)\~1n), and the proof is complete. 

This argument is still valid if m > —n, but the manipulation of the 
integral defining Q requires more justification then. Instead, we shall finesse 


Pseudodifferential Operators 313 


the problem as follows. Pick an integer M > }(n-+m) and let S € W-?™(Q) 
be a parametrix for AM (A = Laplacian) on Q. Then P = PSA” +T 
where T € W~™(Q). We can apply the preceding argument to the operators 
PS and T to see that (PS)* € ™™-?™(0') and TF € W-™(N’) and that 


ocpsyr (2, €) = p(F(x), (Jp(2)]7*€)(—407 [Jp (2)]1€[?)- 
(mod S™—-?™-1(9/)), 


But A” is a differential operator, so the elementary calculations of §1A 
show that (A™”)F is a differential operator on 9 and that its symbol 
is (—4x?|[J#(2)]~1€|?)*" modulo lower order terms. Since (PSAM)F = 
(PS)P(A™)?, the desired result follows from Corollary (8.38). ] 


Remark: This proof can obviously be pushed further to obtain a 
complete asymptotic expansion for the symbol of P’, but putting this 
expansion in a reasonably neat form requires more effort. The definitive 
result can be found in Hérmander [27, vol. III, Theorem 18.1.17]. 


Theorem (8.58) paves the way for defining pseudodifferential operators 
on manifolds. Namely, if M is a C° manifold, a linear map P : €/(M) + 
D’(M) is called a pseudodifferential operator of order m if for any coor- 
dinate chart V C M and any ¢, » € Co°(V), the operator Py yu = yP(¢u) 
is a WDO of order m with respect to some, and hence any, coordinate sys- 
tem on V. (More precisely, this means that if G: V — R” is a coordinate 
map then the transferred operator Poy u = [Pg,y(ueG)]oG—! belongs to 
w™(G(V)).) If M = Q is an open subset of R”, this class of operators is 
larger than &™(Q) in that it includes all smoothing operators. (If P is a 
smoothing operator and ¢,  € C?°(Q) then the distribution kernel of Py y 
belongs to C°, so it follows easily from (8.6) that P € ¥-™(Q). See also 
Exercise 3.) 

On a manifold M, one can define the symbol class S™(M) to be the 
set of functions on the cotangent bundle T* M that satisfy estimates of the 
form (8.2) in any local coordinate system. The precise symbol-operator 
correspondence only works in local coordinates, as the lower-order terms 
transform in complicated ways under coordinate changes, but by Theorem 
(8.58), the local symbols of a YDO on M determine a well-defined equiva- 
lence class in S™(M)/S™~'(M). This is enough to show that the notion of 
ellipticity at a point (z,€) € T* M is independent of the local coordinates. 
One can therefore define the characteristic variety of a ¥DO and the wave 
front set of a distribution on M, just as before, as closed conic subsets of 
T°M (the cotangent bundle of M with the zero section removed). 
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A slightly better global notion of symbol is available for operators with a 
principal symbol. If 2 Cc R" and P € ¥™(Q), P is said to have a principal 
symbol if there is a p” € C®(T°Q) such that p™(z, t€) = t™p™(z, €) for 
t > 0 and op — p™ agrees for large € with an element of S™-!(Q). (The 
restriction to large € is necessary since p™ is usually not C™ at € = 0.) For 
example, the principal symbol of a differential operator is, up to factors of 
2ni, what we called the characteristic form in §1A. Also, if P is elliptic with 
principal symbol p™, any parametrix for P has principal symbol 1/p™. It is 
an easy consequence of Theorem (8.58) that if P isa YDO on a manifold M 
that has a principal symbol in any local coordinate system, these symbols 
patch together globally to make a well-defined function on T*M. 

A more comprehensive account of these matters can be found in Treves 


[53, vol. I]. 


EXERCISES 


1. Suppose M is a smooth k-dimensional submanifold of R", o is surface 
measure on M, and f € C™(IR"). Compute the wave front set of the 
distribution u = f do (i.e, (u,¢) = f ¢f do). (Hint: Use Exercise 3 in 
§8G and Theorem (8.58).) 


The next exercise uses the following extension of Theorem (0.19), which 
may be proved using the ideas in the proof of Proposition (8.14) (see also 
Rudin [41, Theorem 6.20] and Folland [14, Exercise 4.56]): Suppose {Qa} 
is a collection of open sets in R™ and 2 = UQe. There exist sequences 
{¢;} and {y,;} in Co°(Q) such that (i) ~; = 1 on supp ¢;; (ii) each x; is 
supported in some 2,4; each z € 2 has a neighborhood that intersects only 
finitely many supp ¥j;; (iv) > ¢j = 1 on Q. 


2. Suppose {Q4} is a collection of open sets in R" and 2 = JN,. Suppose 
P : &'(Q) — D’(Q) is a linear map with the following properties: (i) 
For each a, the map Py : €’(Qa) > D/'(Q_) defined by Pyu = PulQ. 
belongs to ¥™(Qq). (ii) sing supp Pu C supp u for all u € €’(2). Show 
that P = Q+R where Q € ¥™(Q) and R is smoothing. (Hint: Let ¢; 
and y; be as in the remarks above, and let Qu = > ¥; P(¢;u).) 


3. Suppose 2 is an open set in R" and P : €'(Q) > D’(Q) is a linear 
map such that for any ¢,y € C°(Q), the map u + PP(¢@u) belongs 
to ¥™(Q). Show that P satisfies the hypotheses of Exercise 2 when 
{Qq} is the family of open subsets of 2 whose closure is compact and 
contained in 2. 
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4. Show by example that condition (ii) in Exercise 2 cannot be omitted. 
(Hint: Let Pu(x) = u(z + a) on Q =R", and take each Qq to be a set 
of diameter < |a|.) 
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W™(Q) (pseudodifferential operators), 268 
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Multi-indices and (Pseudo-) Differential Operators: 
D*, 216, 266 

p(x, D) (operator defined by a symbol), 267 
P, (operator defined by an amplitude), 284 
la], a!, c* (multi-index notation), 2 

A (Laplacian), 66 
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&, (normal derivative), 5 

&,—, (interior normal derivative), 116 

O,4, (exterior normal derivative), 117 
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Note: Symbols used only in a single section are generally not listed here. 
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difference quotient, 204 
differential operator, 31 
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Dirichlet integral, 88 
Dirichlet problem, 84, 117, 237 
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heat kernel, 143 
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Hilbert transform, 188 
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Holmgren uniqueness theorem, 54 
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interior Neumann problem, 117 
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Laplace operator, 34, 66 
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Laplacian, 66 

Lax-Milgram lemma, 248 
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Lewy equation, 56 
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linear differential equation, 31 
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local operator, 273 
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locally solvable operator, 58 
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mean value theorem, 69 

—, converse of, 70 
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Neumann problem, 84, 117, 241 
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oblique derivative problem, 140, 241 
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parabolic operator, 142 
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parametrix, 297-8 

partial differential equation, 30 
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Poisson kernel, 87, 93, 96 
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plane wave, 186 
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principal symbol, 314 
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properly elliptic operator, 261 
properly supported operator, 275 
pseudodifferential operator, 267 
—— on a manifold, 313 
pseudolocal operator, 273 


quasi-linear equation, 31 
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Riemann-Lebesgue lemma, 15 
Riesz potential, 151 
Riesz-Schauder theory, 24 
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Schwartz kernel theorem, 272 
segment property, 221 
self-adjoint problem, 250 
sharp Garding inequality, 302 
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smoothing operator, 274 


Sobolev lemma, 194, 225 
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spectral theorem, 28 
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strictly hyperbolic operator, 159 
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tempered distribution, 22 
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three lines lemma, 206 
transpose, 19, 289 
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uniqueness for Dirichlet problem, 72 


Vandermonde matrix, 223 
vector field, 3 


wave front set, 306 

wave operator, 34, 159 

weak solution, 31 

Weierstrass approx. theorem, 148 


Young’s inequality, 9, 10 
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